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SEQ ID NO: 6 

TGCCGCCTTC TTTGATATTC ACTCTGTTGT ATTTCATCTC TTCTTCCCGA TCAAACGATA 60 
TAACAGTCTG TATAACAGTC TGTGAGGAAA TAC7TGGTAT TTCTTCTCAT CACTGTTTTT 120 
ATAAGTAATG TTSAATATTG GATAAGCCTG. . TGTGTCCTTT CTCTTOGGAS ACAAAGCCCA 180 
CAGCAGGTGG TGSTTGCOCT GGTGGCAGCT CAC7GACAGG AGAGGTTTTT TTGCCTGTTT 240 

rmrrrm TT nn nrr aactaacctc ttcttttttc ttagtaaatt ttctactgga joo 

CTCTA TG TTI TGACAGCTCA GAAACATTTC TTCAAAAGAA CAACCTTTT C CAAACTCTAC 360 
AGCCCTTTTC TTTCATTCOC imTGCTT T CTGTGCCAAT CCCTTTGCTI CTGATTCCAT 420 
TATGGAAAAC GT7GATCGCA ACTTGAGGTT TTTATTTATA CTGTGCCTTG AAAGCTTGGA 480 
TACCrCTTCT TACACCAGAT ACCTTATTAA GT7TACGCCA GCTTGATGCT TTATTTTTTC 540 
CCTTTGAAOT AOIGACC ST T C TCr CG TTTT TTTCCTTTCA AACTGGTGAG GCTTAGATTT 600 
TTCT A ATCGG ATTTTTTAOC TCATGATCTA GTTGCATACC CAAATGCTTG TAAATCTTTT 660 
CCTASITAAC A T37 TCAXAA CTTCGGATTT ACATGTTGTA TATACTTGTC ATCTCTGTTT 720 
CTAGTAAAAA TATATGGCAT TTATAGAAAT ACGTAATTCC TQATTTCCTT TTTTTTTATC 180 
TCTATCCTCT CT5TCTACAG CTCAAACAGA CTTCAC7CCT ATTTTTATTT ATACAATTTT 840 
ATATGCACTC T Gf CCTTGGT TCTT G T G TT G TAASGATACA GCCTTAAATT TCCTACAGCC 900 
ATQCTCACTA AS3CCGGTTG TCACAJGSGT 7CAAATGTAA AACGGGCACC TTTOGCTGCT 960 
GCCTTCCOGA GATCCACGAC ACTAAACTGC TTCTCCACTG AGGTATAAAT CGCTTCAGAT 1020 
CCCAGGGAAG TGCAGATCCA CGTGCATATT CTTAAAGAAG AATGAATACT TTCTAAAATA 1080 
TTTTGGCATA GGAACCAAGC TGCATGGATT TGTTTGGGAC TTAAATTATT TTGGTAAOQC 1140 
AGTGCATACC. TTT7AAACAC AGTTCCACCA TGC7AACGAG TCACAGCGTT TATGCAGAAC 1200 
TGAJCCCTGG A73CCTGTTG CAGCTGTTTA CGGCACTGCC TTGCAGTGAG CATTGCAGAT 1260 
aggogtggcg to:tttgtgt OGTGTTCCCA CACGCTOCCA CACACCCACC TCCCCGAACA 1320 
CATCTCACCT GC79SGTACT TTTCAAACCA TCTTAGCAGT AGTAGATGAC TTACTATGAA 1380 
ACACAGAAGT TC~ CAOTTG GATATTCTCA TG33ATGTCT TTTTTCCCAT CTTGGGCAAA 1440 
OTATGATAAA GCATCTCT A T TTGTAAATTA TGCACTTGTT AGTTOC7GAA TCCTTTCTAT 1500 
AOCACCACTT A7TGCAGCAC GTGTAGGCTC TC 3T CT0 GC C TGT G TCTGT G CTTCAATCTT 1560 
TTAAAGCTTC TTTGGAAATA CACTGACTTC AT7CAAGTCT CTTGAAGATA. GTAAACAGTA 1620 
CTTACCTTTG A7CCCAATGA AATCGASCAT TTCAOTT G TA AAAGAATTCC GCCTATTCAT 1680 
ACCATGTAAT GTAATTTTAC ACCCCCAGTG CTGACACTTT GGAATATATT CAAGTAATAG 1740 
ACTTTGGCCT CAZCCTCTT G TGTACTGTAT TTT3TAATAG AAAATATT7T AAACTGTGCA 1800 
TATGATTATT ACATTATGAA AGAGACATTC TGCTGATCTT CAAATGTAAG AAAATGACGA I860 
GTOCGTGTGC T77TATAAAT ACAAGTGATT GCAAATTAGT GCAGGTCTCC TTAAAAAAAA 1920 
AAAAAAAAAG TAATATAAAA AGGACCAGCT OTTI JA CAAG TGAAATACAT TCCTATTTGG 1980 
TAAACAGTTA CATTTTTATG AAGAT7ACCA GCGCTCCTGA CTTTCTAAAC ATAAGGCTGT 2040 
ATTGTCTTCC TCTACCATTG CATTTCCTCA TTCCCAATTT GCACAAGGAT GTCTGGGTAA 2100 
ACTATTCAAG AAA7GCCTT? CAAATACAGC ATGGGAGCTT GTCTGACTTG GAATGC AGAG 2160 
TTGCACTGCA AAATGTCAGG AAATC3ATGT CTCTCAGAAT GCCCAACTCC AAACGATTTT 2220 
AT A TG TG T A T ATAGTAAGCA 6TTTCCTGA T TCCAGCAGOC CAAAGAGTCT GCTGAATGTT 2280 
GTGTTGCCGG A3ACCTCTAT TTCTCAACAA GGTAAGATGO TATCCTASCA ACTGCGGATT 23 «0 
TTAATACATT 7TCAGCACAA GTACTTAGTT AATCTC7ACC TTTAGGGATC O IUIATCAT 2400 
TTTTACATOT TATACTTGAA ATACTGCATA ACTTTTAGCT TTCATGGGTT CCTTTTTTTC 2460 
AGCCTTTAGG AGACTCTTAA GCAATTTGCT GTCCAACTTT TGTGTTGGTC TTAA ACTGCA 2520 
ATAGTAGTTT ACCTTGTATT GAAGAAATAA AGACCATTTT TATATTAAAA AATACTTTTO 2 580 
TCTCTCTTCA TT7TGAC7TG TCTSATATCC TTGCAGTGCC CATTATGTCA mill E ICAC 2 64 0 
ATATTCAGAC ATCAAAACTT AACGTGAGCT CAGTGGAGTT ACAGCTGCGG TTTTGATCCT 2700 



(57) Abstract: The present invention relates to novel 
methods of producing transgenic avians, preferably 
chickens, wherein the incorporated transgene may 
be expressed as a constituent protein of the white 
of a hard-shell egg. The present invention provides 
sperm-mediated transfer for the introduction to an avian 
egg of a transgene encoding a heterologous polypeptide. 
The avian sperm may be irradiated before the transgenic 
gene is incorporated therein. Transgenic genes may 
be incorporated into avian sperm by Hpofection, 
electroporation, restriction enzyme mediated integration 
(REMI) or similar methods. The modified avian 
sperm may then be delivered to an avian oocyte by 
microinjection, intracytoplasmic sperm injection (ICSI) 
or artificial insemination, or by natural coitus after 
the modified avian sperm are returned to a male bird. 
Heterologous nucleic acid may be integrated direcdy 
into the genomic nucleic acid of the oocyte or after first 
integrating the heterologous nucleic acid into the nucleic 
acid of a male germ cell and subsequent delivery of the 
transgenic male germ cell to an oocyte. Alternatively, 
the heterologous nucleic acid may be a episome within 
the sperm, or within the derivative zygote formed by 
the fusion of the sperm and the recipient oocyte, and 
may replicate independendy of the zygote genome. 
Co-segregation of the episome with the replicated ooctye 
genome into all of the daughter cells may be induced by 
the heterologous nucleic acid having a centromeric body 
derived from, for example, a chromosome of a chicken. 
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PRODUCTION OF TRANSGENIC AVIANS USING SPERM-MEDIATED 

TRANSFECTION 

This application claims the benefit of United States Provisional Patent Application 
5 Serial No. 60/323,961, filed September 21, 2001, and United States Provisional Patent 
Application Serial No. 60/324,001, filed September 21, 2001, both of which are 
incorporated by reference herein in their entireties. 

1. FIELD OF THE INVENTION 

10 The present invention relates to methods of producing a transgenic avian by 

introducing a nucleic acid encoding a heterologous protein into the genome of an avian 
oocyte by sperm-mediated transfection. The present invention further relates generally to a 
transgenic avian capable of expressing a heterologous polypeptide, which, preferably is 
deposited into the white of an avian egg, said avian generated by sperm-mediated 

15 transgenesis. The invention further provides vectors containing coding sequences for 
heterologous proteins, the expression of which is under the control of a promoter and other 
regulatory elements that cause expression of the heterologous protein and preferably, lead to 
deposition of the protein in the avian egg. Also included in the invention are avian eggs 
derived from the transgenic avians and the heterologous proteins isolated therefrom. 

20 

2. BACKGROUND 

The field of transgenics was initially developed to understand the action of a single 
gene in the context of the whole animal and the phenomena of gene activation, expression, 
and interaction. The technology has also been used to produce models for various diseases 

25 in humans and other animals and is amongst the most powerful tools available for the study 
of genetics and the understanding of genetic mechanisms and function. From an economic 
perspective, however, the use of transgenic technology for the production of specific 
proteins or other substances of pharmaceutical interest offers significant advantages over 
more conventional methods of protein production by gene expression. (Gordon et aL, 1987, 

30 Biotechnology 5: 1183-1187; Wilmutef a£, 1990, Theriogenology 33: 113-123). 

In particular, the production of monoclonal antibodies by traditional methods is 
labor-intensive and costly. The purification of monoclonal antibodies from serum is a slow 
and low-yielding process. The use of hybridomas cell lines which were developed by fusing 
a B lymphocyte with a myeloma cell to propagate indefinitely in vivo as ascites, or in vitro 

35 in tissue culture requires major expenditures in tissue culture facilities or mice breeding. 
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(Kohler and Milstein, 1975, Nature 256: 495-497). Although various strategies have been 
proposed to overcome the deficiencies in antibody yield (e.g., engineering single-chain 
antibodies (scAb) comprising immunoglobulin heavy and light chain variable regions), no 
method has been proven entirely satisfactory in elevating antibody yields to the levels 

5 desired for adequate commercial production. 

The industry has been experimenting with transgenic animals that can express, for 
example, an exogenous protein such as an antibody under conditions that offer high yield of 
the protein in an active form while incorporating post-translational modifications, such as 
glycosylation, typically required for full functionality of the antibody. In this context, 

10 heterologous nucleic acids have been engineered so that an expressed protein may be joined 
to a protein or peptide that will allow secretion of the transgenic expression product into 
milk or urine, from which the protein may then be recovered. These procedures have had 
limited success, however, and may require lactating animals, with the attendant costs of 
maintaining individual animals or herds of large species, including cows, sheep or goats. 

15 

Avian Transgenics 

One transgenic system that holds potential is the avian reproductive system. The 
exogenous protein can be produced in the white of an avian egg from which it may be 
readily purified. (MacArthur, PCT Publication WO 97/47739). The production of an avian 

20 egg begins with formation of a large yolk in the ovary of the hen. The unfertilized oocyte or 
ovum is positioned on top of the yolk sac. After ovulation, the ovum passes into the 
infundibulum of the oviduct where it is fertilized, if sperm are present, and then moves into 
the magnum of the oviduct, lined with tubular gland cells. These cells secrete the egg-white 
proteins, including ovalbumin, lysozyme, ovomucoid, ovotransferrin, conalbumin, and 

25 ovomucin, into the lumen of the magnum where they are deposited onto the avian embryo 
and yolk. 

The hen oviduct, for example, can serve as an excellent protein bioreactor because 
of the high levels of protein production, the promise of proper folding and post-translation 
modification of the target protein, the ease of product recovery, and the shorter 

30 developmental period of chickens compared to other potential animal species. The 
economic advantage of breeding flocks of transgenic birds laying eggs expressing 
exogenous proteins would be significant when compared to more traditional animals, such 
as cows, sheep or goats, producing heterologous protein in milk. What is needed, however, 
is an efficient method of introducing a heterologous nucleic acid into a recipient avian 

35 embryonic cell. 
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Vectors 

Genetic information has been transferred to avian embryos using vectors. 
Bosselman et al in U.S. Patent No. 5,162,215 describes a method for introducing a 
replication-defective retroviral vector into a pluripotent stem cell of an unincubated chick 

5 embryo, and further describes chimeric chickens whose cells express a heterologous vector 
nucleic acid sequence. However, the percentage of Gl transgenic offspring (progeny from 
vector-positive male GO birds) was low and varied between 1% and approximately 8%. 
In addition, the use of viral vectors poses limitations, including limitations on transgene size 
and potential viral infection of the offspring, thus, posing significant regulatory issues for 

10 production of biologies. 

Similarly, Jaenisch reported that while retroviral vectors did transfer genetic 
information to embryos, the resulting animals were mosaics with gene insertions at various 
loci in the genomic nucleic acid. (1976, Proc. Natl. Acad Set USA 73: 1260-1264). The 
transgenes were also differentially expressed in the different tissues of each animal. 

15 (Jaenisch, 1980, Cell 19: 181-188). 

Nuclear Transfer 

Nuclear transfer from cultured cell populations is another route to produce 
transgenics, wherein donor cells may be sexed, optionally genetically modified, and then 

20 selected in culture before their use. The resultant transgenic animal originates from a single 
transgenic nucleus and therefore, mosaics are avoided. Nuclear transfer from cultured 
somatic cells also provides a route for directed genetic manipulation of animal species, 
including the addition or "knock-in" of genes, and the removal or inactivation or "knock- 
out 5 ' of genes or their associated control sequences (Polejaeva et al, 2000, Theriogenology 

25 53:117-26). 

Two types of recipient cells are commonly used in nuclear transfer procedures: 
oocytes arrested at the metaphase of the second meiotic division (MS) and which have a 
metaphase plate with the chromosomes arranged on the meiotic spindle, and pronuclear 
zygotes. In agricultural mammals, however, development does not always occur when 

30 pronuclear zygotes are used, and, therefore, Mil-arrested oocytes are the preferred recipient 
cells. Enucleated two-cell stage blastomeres of mice have also been used as recipients. 

After enucleation and introduction of donor genetic material, the reconstructed 
embryo is cultured to the morula or blastocyte stage, and transferred to a recipient animal, 
either in vitro or in vivo, and developed to term. (Eyestone and Campbell, 1999, J. Reprod 

35 Fertil. Suppl. 54: 489-97). Double nuclear transfer has been reported in which an activated, 
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previously transferred nucleus is removed from the host unfertilized egg and transferred 
again into an enucleated fertilized embryo. Activation (initiation of development) is most 
often induced chemically. Cultured cells can also be frozen and stored indefinitely for 
future use. 

5 Although gene targeting techniques combined with nuclear transfer hold tremendous 

promise for nutritional and medical applications, current approaches suffer from several 
limitations, including long generation times between the founder animal and production 
transgenic herds, and extensive husbandry and veterinary costs. It is therefore desirable to 
use a system where cultured somatic cells for nuclear transfer are more efficiently 

10 employed. 

Sperm-Mediated Transfection Mechanism 

A promising method for producing transgenic animals is the stable transfection of 
male germ cells in vitro and their transfer to a recipient oocyte. PCT Publication WO 

15 87/05325 discloses a method of transferring organic and/or inorganic material into sperm or 
egg cells by using liposomes. Bachiller et ai used Lipofectin-based liposomes to transfer 
DNA into mice sperm, and provided evidence that the liposome transfected DNA was 
ovewhelmingly contained within the sperm's nucleus. (1991, Mot Reprod. Develop. 30: 
194-200). However, no transgenic mice could be produced by this technique. 

20 Similarly, Nakanishi and Iritani used Lipofectin-based liposomes to associate 

heterologous DNA with chicken sperm, which were in turn used to artificially inseminate 
hens. (1993, Mot Reprod Develop. 36: 258-261). Although the heterologous DNA was 
detectable in many of the resultant fertilized eggs, there was no evidence of genomic 
integration of the heterologous DNA either in the DNA-liposome treated sperm or in the 

25 resultant chicks. 

Heterologous DNA may also be transferred into sperm cells by a process called 
electroporation that creates temporary, short-lived pores in the cell membrane of living cells 
by exposing them to a sequence of brief electrical pulses of high field strength. The pores 
allow cells to take up heterologous material such as DNA, while only slightly compromising 

30 cell viability. Gagne et al. discloses the use of electroporation to introduce heterologous 
DNA into bovine sperm subsequently used to fertilize ova. (1991, Mot Reprod. Develop. 
29: 6-15). However, there was no evidence of integration of the electroporated DNA either 
in the sperm nucleus or in the nucleus of the egg subsequent to fertilization by the sperm. 

Yet another method initially developed for integrating heterologous DNA into yeasts 

35 and slime molds, and later adapted to avian sperm, is restriction enzyme mediated 
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integration (REMT), which utilizes a linear DNA derived from a plasmid DNA by cutting 
that plasmid with a restriction enzyme that generates single-stranded cohesive ends. 
(Shemesh et al. % PCT International Publication WO 99/42569). The linear, cohesive-ended 
DNA together with the restriction enzyme used to produce the cohesive ends is then 

5 introduced into the target cells by electroporation or liposome transfection. The restriction 
enzyme is then thought to cut the genomic DNA at sites that enable the heterologous DNA 
to integrate via its matching cohesive ends. (Schiestl and Petes, 1991, Proc. Natl. Acad. 
Sci. USA 88: 7585-7589). Although Shemesh described transgenic birds that were resistant 
to Infectious Bursal Disease, there was no evidence of expression or deposition of a 

10 heterologous protein in the oviduct for deposition onto egg whites. 

What is needed, therefore, is an efficient method of generating a transgenic avian 
capable of expressing a heterologous protein coded by a transgene, particularly in the 
oviduct for deposition into egg whites. 

15 3. SUMMARY OF THE INVENTION 

The invention provides methods for the stable introduction by sperm-mediated 
transfection of heterologous coding sequences into the genome of an avian, preferably a 
chicken, and expressing those heterologous coding sequence to produce desired proteins 
and/or to alter the phenotype of the transgenic avian. Synthetic vectors and gene promoters 

20 useful in the methods are also provided by the present invention, as are transgenic avians 
that express a heterologous protein and avian eggs, preferably chicken eggs, containing a 
heterologous protein. In a preferred embodiment, the vectors useful in methods of the 
invention are not eukaryotic viral, more preferably not retroviral, vectors (although the 
vectors may contain transcriptional regulatory elements, such as promoters, from eukaryotic 

25 viruses). In other embodiments, however, the vectors are retroviral vectors. 

One aspect of the present invention is a method of producing a transgenic avian, 
preferably a chicken, by introducing in an avian oocyte at least one transgene encoding at 
least one heterologous polypeptide by sperm-mediated transfection. The method comprises 
• first, isolating an avian sperm, second, incorporating a transgene into the avian sperm, and 

30 third, delivering the modified avian sperm to an avian oocyte. In one embodiment, the 
avian sperm is irradiated with gamma rays before the transgene is incorporated therein. 

In one embodiment, the transgene is injected directly into the testis of a male avian 
and incorporated in the avian sperm. The modified sperm is then delivered to the avian 
oocyte by mating the male avian with a wild type or transgenic female avian. 

35 
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In another embodiment, the transgene is incorporated in the avian sperm in vitro by 
Iipofection, electroporation, restriction enzyme mediated integration (REMT) or similar 
methods. In a preferred embodiment, the modified avian sperm is then delivered to the 
avian oocyte by natural coitus after the modified avian sperm are returned to the testis of a 

5 male avian. In another preferred embodiment, the modified avian sperm is delivered to the 
avian oocyte by microinjection (e.g., intracytoplasmic sperm injection (ICIS) or standard 
artificial fertilization methods). The resulting transgenic embryo can then be transferred to 
the oviduct of a recipient hen for development and to be laid as a shelled egg (or, 
alternatively, cultured ex vivo). The shelled egg is incubated to hatch a transgenic avian that 

10 has incorporated, preferably integrated into its genome, the selected nucleic acid. In 
preferred embodiments, the avian sperm is first irradiated before incorporated with the 
transgene. 

In certain embodiments, a transgene comprising a heterologous nucleic acid may be 
integrated directly into the genomic nucleic acid of an avian sperm and subsequently 

15 delivered to an avian oocyte. When the heterologous nucleic acid is directly integrated into 
the genome of the avian sperm which then fertilizes an avian oocyte, the resulting 
transgenic embryo will include the transgenic heterologous nucleic acid in all of its cells. In 
preferred embodiments, the transgenic heterologous nucleic acid is incorporated into at least 
one embryonic cell, preferably the germinal disk of an early stage embryo, that then develop 

20 into a transgenic avian. 

Alternatively, the heterologous nucleic acid may be an episome within the modified 
avian sperm, or within the derivative zygote formed by the fusion of the modified avian 
sperm and the avian oocyte. The episome may replicate independently of the zygote 
genome. When the heterologous nucleic acid is episomal with respect to the genome of the 

25 transgenic zygote, and the episomal nucleic acid has a centromeric body, most, if not all, of 
the cells of the transgenic embryo will include the heterologous nucleic acid. Accordingly, 
in preferred embodiments, the transgene further comprises centromere and/or telomere 
sequences of an avian chromosome. 

The invention further provides method for incorporating at least one transgene into 

30 the genome of a spermatozoon cell or a precursor thereof isolated from a donor male avian, 
and returning the modified cell to the testis of a recipient male avian, preferably the donor 
male avian, so that a genetically modified male gamete is produced by the male avian. 
Breeding the male avian with a female of its species will generate a transgenic progeny 
carrying the at least one transgene in its genome. 

35 
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The invention also provides methods for introducing a heterologous nucleic acid to 
an avian oocyte in addition to those described in United States Application Serial No. 
09/877,374, filed June 8, 2001, entitled "Production of Monoclonal Antibody By a 
Transgenic Chicken", by Jeffrey C. Rapp; and United States Application Serial No. 

5 , filed September 1 8, 2002, entitled "Production of a Transgenic Avian By 

Cytoplasmic Injection", by Jeffrey C. Rapp and Leandro Christmann, both of which are 
incorporated by reference herein in their entireties. In certain embodiments, the avian 
oocyte is removed from the ovaries of a donor female avian to facilitate in vitro fertilization 
by the modified avian sperm of the invention. In other certain embodiments, the modified 

10 avian sperm is delivered to an avian oocyte in vivo by natural coitus. The fertilized ova is 
then, preferably, returned to or maintained in the oviduct of the donor female avian or a 
surrogate female avian to be laid as a hard-shell egg or, as an alternative, cultured ex vivo. 
The hard-shell egg is incubated and hatched, producing a transgenic chick that expresses a 
heterologous protein and/or that can be bred to generate a line of transgenic avians 

15 expressing a heterologous protein. 

Preferably, the avian sperm or the reproductive system of a male avian, preferably 
the seminiferous tubules and/or site of sperm production, development, and/or storage in the 
testis, is irradiated by gamma rays before transgene incorporation. More preferably, the 
transgene is integrated directly into the genome of the avian sperm. Most preferably, the 

20 transgene further comprises centromere and/or telomere sequences. 

In particular embodiments, the level of mosaicism of the transgene (percentage of 
cells containing the transgene) in avians hatched from sperm-mediated transfected embryos 
(Le., the GOs) is greater than 5%, 10%, 25%, 50%, 75% or 90%, or is the equivalent of one 
copy per one genome, two genomes, five genomes, seven genomes or eight genomes, as 

25 determined by any number of techniques known in the art and described infra. In 
additional particular embodiments, tie percentage of GOs that transmit the transgene to 
progeny (Gls) is greater than 5%, preferably, greater than 10%, 20%, 30%, 40%, and, most 
preferably, greater than 50%. 

In certain other embodiments, the level of transgenics that result from mating with a 

30 wild type or transgenic avian avians hatched from sperm-mediated transfected embryos (i. e. , 
the GOs) is greater than 5%, 10%, 25%, 50%, 75% or 90%. 

In another embodiment, the present invention provides methods for producing 
heterologous proteins in avians. Transgenes are introduced by sperm-mediated transfection 
into the genome of an avian oocyte which becomes fertilized and then develops into a 

35 transgenic avian. The heterologous protein(s) of interest may be expressed in the tubular 
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gland cells of the magnum of the oviduct, secreted into the lumen, and most preferably, 
deposited within the egg white onto the egg yolk or expressed, for example, in the serum of 
the avian. In preferred embodiments, the level of expression of the heterologous protein in 
the egg white of eggs laid by GO and/or Gl chicks and/or their progeny is greater than 5 jig, 
5 10 ug, 50 |ig, 100 jig, 250 jig, 500 ug or 750 ug, more preferably greater than 1 mg, 2 mg, 5 
mg, 10 mg, 20 mg, 50 mg, 100 mg, 200 mg, 500 mg, 700 mg, 1 gram, 2 grams, 3 grams, 4 
grams or 5 grams. 

The transgenic avians can also be bred to identify those avians that carry the 
transgene in their germ line. The exogenous gene coding for the heterologous proteins can 

1 0 therefore be transmitted by sperm-mediated transfection of the exogenous gene into the 
avian oocytes, and by subsequent stable transmission of the exogenous gene to the avian's 
offspring in a Mendelian fashion. More information on Mendelian inheritance can be found 
in Haiti and Jones, 2001, Genetics: Analysis of Genes and Genomes, 5th ed., Jones & 
Bartiett Publishers, Inc., the content of which is incorporated by reference herein in its 

15 entirety. 

Another aspect of the invention provides for the isolation of heterologous proteins in 
transgenic avians and the use thereof in pharmaceutical products including but not limited 
to vaccines, biologies and, particularly, therapeutically or diagnostically useful antibodies. 
The expressed heterologous protein(s) of interest may be collected and processed using 

20 standard techniques from the avian eggs, preferably the egg white, the serum, or other 
tissues from the transgenic avian. 

The present invention further provides methods for producing a heterologous protein 
in an avian oviduct The method comprises, as a first step, providing a vector containing a 
coding sequence and a promoter that functions in avians, preferably in the avian magnum, 

25 operably linked to the coding sequence, so that the promoter can effect expression of the 
nucleic acid in the tubular gland cells of the magnum of an avian oviduct and/or in any other 
desired tissue of the avian. In a preferred embodiment, the vector containing the transgene 
is not a eukaryotic viral vector (preferably, not a retroviral vector, such as but not limited to 
reticuloendotheliosis vims (REV), ALV or MMLV) or derived from a eukaryotic virus (but, 

30 in certain embodiments, may contain promoter and/or other gene expression regulatory 
sequences from a eukaryotic virus, such as, but not limited to, a cytomegalovirus promoter). 
Next, the vector is introduced into avian sperm in vitro by lipofection, electroporation, 
restriction enzyme mediated integration (REM3) or similar methods, or in vivo by directly 
injecting into the testis, so that the vector sequence may be incorporated into the avian 

35 sperm. In preferred embodiments, the avian sperm or precursor cells are irradiated by 
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gamma rays before the vector sequence is incorporated therein. In another preferred 
embodiment, the vector sequence further comprises centromere and/or telomere sequence. 
Then, the modified avian sperm are delivered to an avian oocyte by natural coitus or in vitro 
by microinjection or artificial insemination to form a transgenic embryonic cell In certain 

5 embodiments, the recipient avian oocyte is wild type unmodified or preferably, modified in 
a manner that facilitates the delivery of transgene by the modified avian sperm. In certain 
other embodiments, the recipient avian oocyte is derived from a first-generation or 
preferably, second-generation transgenic avian whose germ-line carries the transgene. 
Finally, a mature transgenic avian that expresses the exogenous protein in its oviduct is 

1 0 derived from the transgenic embryonic cell or by breeding a transgenic avian derived from 
the transgenic embryonic cell. 

The present invention further provides promoters useful for expression of the 
heterologous protein in the egg. For example, the transgene may comprise regions of at 
least two promoters derived from an avian including, but not limited to, an oviduct-specific 

15 promoter such as ovalbumin, lysozyme, ovomucoid, ovotransferrin, conalbumin, and 
ovomucin promoter or any other promoter that directs expression of a gene in an avian, 
particularly in a specific tissue of interest, such as the magnum, and a protamine promoter, 

■ 

or a fragment thereof which is sufficient to drive the expression of a marker gene such as 
Green Fluorescent Protein (GFP). Alternatively, the promoter used in the expression vector 

20 may be derived from that of the lysozyme gene that is expressed in both the oviduct and 
macrophages. In particular embodiments, the gene regulatory sequences are flanked by 
matrix attachment regions (MARs), preferably, but not limited to those associated with the 
lysozyme gene in chickens or other avians. The nucleic acid encoding the polypeptide may 
be operably linked to a transcription promoter and/or a transcription terminator. 

25 Other embodiments of the invention provide for transgenic avians, such as chickens 

or quail, carrying a transgene in the genetic material of their germ-line tissue, preferably 
where the transgene was not introduced into the avian genome using a eukaryotic viral 
promoter. The transgene incorporated into the genomic DNA of a recipient avian can 
encode at least one polypeptide that may be, for example, but is not limited to, a cytokine, a 

30 growth factor, enzyme, structural protein, immunoglobulin, or any other polypeptide of 
interest that is capable of being expressed by an avian cell or tissue. Preferably, the 
heterologous protein is a mammalian, preferably a human, protein or derived from a 
mammalian, or preferably a human, protein (e.g. , a derivative or variant thereof). In 
particular embodiments, the invention provides heterologous proteins isolated or purified 

35 from an avian tissue, preferably serum, more preferably eggs, most preferably egg whites, 
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and pharmaceutical compositions comprising such heterologous proteins. In a more 
preferred embodiment, the heterologous protein is an antibody that is human (including 
antibodies produced from human immunoglobulin sequences in mice or in antibody 
libraries or synthetically produced but having variable domain framework regions that are ■ 

5 the same as or homologous to human framework regions) or humanized. 

The present invention further relates to nucleic acid vectors (preferably, not derived 
from eukaryotic viruses, except, in certain embodiments, for eukaryotic viral promoters and/ 
or enhancers) and trans genes inserted therein that incorporate multiple polypeptide- 
encoding regions, wherein a first polypeptide-encoding region is operatively linked to a 

10 transcription promoter and a second polypeptide-encoding region is operatively linked to an 
Internal Ribosome Entry Sequence (IRES). For example, the vector may contain coding 
sequences for two different heterologous proteins (e.g., the heavy and light chains of an 
immunoglobulin) or the coding sequences for all or a significant part of the genomic 
sequence for the gene from which the promoter driving expression of the transgene is 

15 derived, and the heterologous protein desired to be expressed (e.g., a construct containing 
the genomic coding sequences, including introns, of the avian lysozyme gene when the avian 
lysozyme promoter is used to drive expression of the transgene, an IRES, and the coding 
sequence for the heterologous protein desired to be expressed downstream (i.e., 3' on the 
RNA transcript of the IRES). Thus, in certain embodiments, the nucleic acid encoding the 

20 heterologous protein is introduced into the 5' untranslated or 3 f untranslated regions of an 
endogenous gene, such as but not limited to, ovalbumin, lysozyme, ovomucoid, 
ovotransferrin, conalbumin, and ovomucin, with an IRES sequence directing translation of 
the heterologous sequence. 

Such nucleic acid constructs, when inserted into the genome of an avian and 

25 expressed therein, will generate individual polypeptides that may be post-translationally 
modified, for example, glycosylated or, in certain embodiments, form complexes, such as 
heterodimers with each other in the white of the avian egg. Alternatively, the expressed 
polypeptides may be isolated from an avian egg and combined in vitro, or expressed in a 
non-reproductive tissue such as serum. In other embodiments, for example, but not limited 

30 to, when expression of both heavy and light chains of an antibody is desired, two separate 
constructs, each containing a coding sequence for one of the heterologous proteins operably 
linked to a promoter (either the same or different promoters), are introduced into embryonic 
cells by sperm-mediated transfection to generate transgenic avians that harbor both 
transgenes in their genomes and expressing both heterologous proteins are identified. 

35 Alternatively, two transgenic avians each containing one of the two heterologous proteins 



-10- 



WO 03/024199 



PCT/US02/30156 



(e.g., one transgenic avian having a transgene encoding the light chain of an antibody and a 
second transgenic avian having a transgene encoding the heavy chain of the antibody) can 
be bred by Mendelian genetics to obtain an avian containing both transgenes in its germline 
and expressing both transgene encoded proteins, preferably in eggs. (See Hartl and Jones, 

5 2001, Genetics: Analysis of Genes and Genomes, 5th ed., Jones & Bartlett Publishers, Inc., 
the content of which is incorporated by reference herein in its entirety). 

For convenience, certain terms employed in the specification, examples, and 
appended claims are collected here. 

Additional objects and aspects of the present invention will become more apparent 

1 0 upon review of the detailed description set forth below when taken in conjunction with the 
accompanying figures, which are briefly described as follows. 

3.1 DEFINITIONS 

The term "animal" as used herein refers to all vertebrate animals, including birds. It 
1 5 also includes an individual animal in all stages of development, including embryonic and 
fetal stages. 

The term "avian" as used herein refers to any species, subspecies or race of organism 
of the taxonomic class aves, such as, but not limited to, chicken, quail, turkey, duck, goose, 
pheasants, parrots, finches, hawks, crows and ratites including ostrich, emu and cassowary. 
20 The term includes the various known strains of Gcdlus gallus, or chickens, (for example, 
White Leghorn, Brown Leghorn, Barred-Rock, Sussex, New Hampshire, Rhode Island, 
Ausstralorp, Minorca, Amrox, California Gray, Italian Partridge-colored), as well as strains 
of turkeys, pheasants, quails, duck, ostriches and other poultry commonly bred in 
commercial quantities. 

25 The term "male germ cells" as used herein refers to sperm, sperm cells, spermatozoa 

(/. e. , male gametes) and developmental precursors thereof. Male germ cells with the 
capacity to swim and transfer nucleic acid to an ovum are herein referred to as "viable male 
germ cells." In fetal development, primordial germ cells are thought to arise from the 
embryonic ectoderm, and are first seen in the epithelium of the endodermal yolk sac at the 

30 E8 stage. From there they migrate through the hindgut endoderm to the genital ridges. In 
the sexually mature male vertebrate animal, there are several types of cells that are 
precursors of spermatozoa, and which can be genetically modified, including the primitive 
spermatogonia! stem cells, known as AO/As, which differentiate into type B spermatogonia. 
The latter further differentiate to form primary spermatocytes, and enter a prolonged meiotic 

35 prophase during which homologous chromosomes pair and recombine. Useful precursor 
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cells at several morphological/developmental stages are also distinguishable: preleptotene 
spermatocytes, leptotene spermatocytes, zygotene spermatocytes, pachytene spermatocytes, 
secondary, spermatocytes, and the haploid spermatids. The latter undergo further 
morphological changes during spermatogenesis, including the reshaping of their nucleus, 

5 the formation of aerosome, and assembly of the tail. The final changes in the spermatozoon 
(i.e., male gamete) take place in the genital tract of the female, prior to fertilization. 

The terms "ovum" and "oocyte" are used interchangeably herein. Although only 
one ovum matures at a time, an animal is born with a finite number of ova. In avian 
species, such as a chicken, ovulation, which is the shedding of an egg from the ovarian 

10 follicle, occurs when the brain's pituitary gland releases a luteinizing hormone, LH. Mature 
follicles form a stalk or pedicel of connective tissue and smooth muscle. Immediately after 
ovulation the follicle becomes a thin-walled sac, the post-ovulatory follicle. The mature 
ovum erupts from its sac and starts its journey through the oviduct. Eventually, the ovum 
enters the infundibulum where fertilization occurs. Fertilization must take place within 15 

15 minutes of ovulation, before the ovum becomes covered by albumen. During fertilization, 
sperm (avians have polyspermic fertilization) penetrate the blastodisc. When the sperm 
lodges within this germinal disk, an embryo begins to form as a "blastoderm" or "zygote." 

The term "embryonic cells" as used herein refers to cells that are typically single cell 
embryos, fertilized or unfertilized, or the equivalent thereof, and is meant to encompass 

20 dividing embryos, such as two-cell, four-cell, or even later stages as described by Eyal- 
Giladi and Kochav (1976, Dev. Biol. 49: 321-337) and ova 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 12, 
14, 16, 1 8, or 20 hours after the preceding lay. The embryonic cells may be isolated freshly, 
maintained in culture, or reside within an embryo. 

The term "fragment" as used herein to refers to an at least 10, 20, 50, 75, 100, 150, 

25 200, 250, 300, 500, 1000, 2000 or 5000 nucleotide long portion of a nucleic acid (e.g., 
cDNA) that has been constructed artificially (e.g., by chemical synthesis) or by cleaving a 
natural product into multiple pieces, using restriction endonucleases or mechanical shearing, 
or enzymatically, for example, by PCR or any other polymerizing technique known in the 
art, or expressed in a host cell by recombinant nucleic acid technology known to one of skill 

30 in the art. The term "fragment" as used herein may also refer to an at least 5, 10, 20, 30, 40, 
50, 75, 100, 150, 200, 250, 300, 400, 500, 1000, 2000 or 5000 amino acid portion of a 
polypeptide, which portion is cleaved from a naturally occurring polypeptide by proteolytic 
cleavage by at least one protease, or is a portion of the naturally occurring polypeptide 
synthesized by chemical methods or using recombinant DNA technology (e.g. , expressed 

35 
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from a portion of the nucleotide sequence encoding the naturally occurring polypeptide) 
known to one of skill in the art. 

The term "isolated nucleic acid" as used herein refers to a nucleic acid that has been 
removed from other components of the cell containing the nucleic acid or from other 

5 components of chemical/synthetic reaction used to generate the nucleic acid. In specific 
embodiments, the nucleic acid is 50%, 60%, 70%, 80%, 90%, 95%, 99% or 100% pure. 
The "isolated nucleic acid" is neither (a) identical to that of any naturally occurring nucleic 
acid nor (b) identical to that of any fragment of a naturally occurring genomic nucleic acid 
spanning more than three separate genes, and includes DN A, RNA, or derivatives or 

10 variants thereof. The term covers, for example, (a) a DNA which has the sequence of part 
of a naturally occurring genomic molecule but is not flanked by at least one of the coding 
sequences that flank that part of the molecule in the genome of the species in which it 
naturally occurs; (b) a nucleic acid incorporated into a vector or into the genomic nucleic 
acid of a prokaryote or eukaryote in a manner such that the resulting molecule is not 

15 identical to any vector or naturally occurring genomic DNA; (c) a separate molecule such as 
a cDNA, a genomic fragment, a fragment produced by polymerase chain reaction (PCR), 
ligase chain reaction (LCR) or chemical synthesis, or a restriction fragment; (d) a 
recombinant nucleotide sequence that is part of a hybrid gene, i.e. 9 a gene encoding a fusion 
protein; and (e) a recombinant nucleotide sequence that is part of a hybrid sequence that is 

20 not naturally occurring. The techniques used to isolate and characterize the nucleic acids 
and proteins of the present invention are well known to those of skill in the art and standard 
molecular biology and biochemical manuals may be consulted to select suitable protocols 
without undue experimentation. See, e.g. 9 Sambrook et al, Molecular Cloning: A 
Laboratory Manual, 3rd ed., Cold Spring Harbor Press (2001); the content of which is 

25 herein incorporated by reference in its entirety. 

By the use of the term "enriched* in reference to nucleic acid it is meant that the 
specific DNA or RNA sequence constitutes a significantly higher fraction of the total DNA 
or RNA present in the cells or solution of interest than in normal or diseased cells or in the 
cells from which the sequence was taken. Enriched does not imply that mere are no other 

30 DNA or RNA sequences present, just that the relative amount of the sequence of interest 
has been significantly increased, for example, by 1 fold, 2 fold, 5 fold, 10 fold, 50 fold, 100 
fold, 500 fold, 1000 fold, 10,000 fold, 100,000 fold or 1,000,000 fold. The other DNA may, 
for example, be derived from a yeast or bacterial genome, or a cloning vector, such as a 
plasmid or a viral vector. 

35 
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The term "transcription regulatory sequences" as used herein refers to nucleotide 
sequences that are associated with a gene nucleic acid sequence and that regulate the 
transcriptional expression of the gene. The ''transcription regulatory sequences" may be 
isolated and incorporated into a vector nucleic acid to enable regulated transcription in 

5 appropriate cells of portions of the vector DNA. Exemplary transcription regulatory 
sequences include enhancer elements, hormone response elements, steroid response 
elements, negative regulatory elements, and the like. The 'transcription regulatory 
sequences" may be isolated and incorporated into a vector nucleic acid to enable regulated 
transcription in appropriate cells of portions of the vector DNA. The "transcription 

10 regulatory sequence' 5 may precede, but is not limited to, the region of a nucleic acid 
sequence that is in the region 5' of the end of a protein coding sequence that may be 
transcribed into mRNA. Transcriptional regulatory sequences may also be located within a 
protein coding region, in regions of a gene that are identified as "intron" regions, or may be 
in regions of nucleic acid sequence that are in the region of nucleic acid. 

15 The term "promoter" as used herein refers to the DNA sequence that determines the 

site of transcription initiation by an RNA polymerase. A "promoter-proximal element 3 ' may 
be a regulatory sequence within about 200 base pairs of the transcription start site. A 
"magnum-specific" promoter, as used herein, is a promoter that is primarily or exclusively 
active in the tubular gland cells of the avian magnum. Useful promoters also include 

20 exogenously inducible promoters. These are promoters that can be "turned on" in response 
to an exogenously supplied agent or stimulus, which is generally not an endogenous 
metabolite or cytokine. Examples include an antibiotic-inducible promoter, such as a 

■ 

tetracycline-inducible promoter, a heat-inducible promoter, a Ught-inducible promoter, or a 
laser inducible promoter. (See, e.g. , Halloran et al y 2000, Development 127: 1 953-1960; 

25 Gemer et al 9 2000, Int. J. Hyperthermia 16: 171-81; Rang and Will, 2000, Nucleic Acids 
Res.2%: 1120-5; Hagihara etal, 1999, Cell Transplant 8: 4314; Huang et at, 1999,Mo/. 
Med. 5: 129-37; Forster et aL, 1999, Nucleic Acids Res. 27: 708-10; Liu et al., 1998, 
Biotechniques 24: 624-8, 630-2; the contents of which have been incorporated herein by 
reference in their entireties). 

30 To facilitate manipulation and handling of the nucleic acid to be administered, the 

nucleic acid is preferably inserted into a cassette where it is operably linked to a promoter. 
The promoter should be capable of driving expression in the desired cells. The selection of 
appropriate promoters can be readily accomplished. For some applications, a high 
expression promoter is preferred such as the cytomegalovirus (CMV) promoter. Other 

35 promoters useful in the present invention include the Rous Sarcoma Virus (RSV) promoter 
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(Davis et al., 1993, Hum. Gene Therap. 4:151). In other embodiments, all or aportion of 
the, for example, lysozyme, ovomucoid, albumin, conalbumin or ovotransferrin promoters, 
which direct expression of proteins present in egg white, are used, as detailed infra, or 
synthetic promoters such as the MDOT promoter described infra. 

5 The term "expressed" or "expression" as used herein refers to the transcription from 

a gene to give an RNA nucleic acid molecule complementary at least in part to a region of 
one of the two nucleic acid strands of the gene. The term "expressed" or "expression" as 
used herein also refers to the translation from said RNA nucleic acid molecule to give a 
protein or polypeptide or a portion thereof. 

10 The term "matrix attachment regions" as used herein refers to DNA sequences 

having an affinity or intrinsic binding ability for the nuclear scaffold or matrix. The MAR 
elements of the chicken lysozyme locus were described by Phi-Van et aL, 1996, E.M.B. 0. J. 
76:665-664 and Phi-Van, L. and Startling, W.H., 1996, Biochem. 35:10735-10742; 
incorporated herein by reference in their entireties. 

15 The term "nucleic acid vector" as used herein refers to a natural or synthetic single 

or double stranded plasmid or viral nucleic acid molecule, or any other nucleic acid 
molecule, such as but not limited to YACs, BACs, bacteriophage-derived artificial 
chromosome (BBPAC), cosmid or PI derived artificial chromosome (PAC), that can be 
transfected or transformed into cells and replicate independently of, or within, the host cell 

20 genome. A circular double stranded vector can be linearized by treatment with an 

appropriate restriction enzyme based on the nucleotide sequence of the vector. A nucleic 
acid can be inserted into a vector by cutting the vector with restriction enzymes and ligating 
the pieces together. The nucleic acid molecule can be RNA or DNA, 

The term "expression vector" as used herein refers to a nucleic acid vector that 

25 comprises regulatory sequences operably linked to a nucleotide sequence coding for at least 
one polypeptide. As used herein, the term "regulatory sequences" includes promoters, 
enhancers, and other elements that may control expression- Standard molecular biology 
textbooks such as Sambrook et aL, (supra) and Lodish et aL, eds "Molecular Cell Biology" 
Freeman (2000) and incorporated herein by reference in their entireties, may be consulted to 

30 design suitable expression vectors, promoters, and other expression control elements. It 
should be recognized, however, that the choice of a suitable expression vector depends upon 
multiple factors including the choice of the host cell to be transformed and/or the type of 
protein to be expressed. Also useful for various applications are tissue-selective {i.e., tissue- 
specific) promoters, i.e., promoters from which expression occurs preferentially in cells or a 

35 particular kind of tissue, compared to one or more other types of tissue. For example, 
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chicken oviduct-specific promoters naturally associated with the proteins of avian egg 
whites including, but not limited to, lysozyme, ovomucoid, albumin, conalbumin, and 

ovotransferrin may be used. 

The term "recombinant cell" refers to a cell that has a new combination of nucleic 

5 acid segments that are not covalently linked to each other in nature in that particular 
configuration. A new combination of nucleic acid segments can be introduced into an 
organism using a wide array of nucleic acid manipulation techniques available to those 
skilled in the art. A recombinant cell can be a single eukaryotic cell, or a single prokaryotic 
cell, or a mammalian cell. The recombinant cell may harbor a vector that is extiagenomic. 

10 An extragenomic nucleic acid vector does not insert into the cell's genome. A recombinant 
cell can further harbor a vector or a portion thereof (e.g. , the portion containing the 
regulatory sequences and the coding sequence) that is intragenomic. The term 
"intragenomic" defines a nucleic acid construct incorporated within the recombinant cell's 
genome. 

1 5 The terms "recombinant nucleic acid" and "recombinant DNA" as used herein refer 

to a combination of at least two nucleic acid sequences that is not naturally found in a 
eukaryotic or prokaryotic cell in that particular configuration. The nucleic acid sequences 
may include, but are not limited to, nucleic acid vectors, gene expression regulatory 
elements, origins of replication, sequences that when expressed confer antibiotic resistance, 

20 and protein-encoding sequences. The term "recombinant polypeptide" is meant to include a 
polypeptide produced by recombinant DNA techniques such that it is distinct from a 
naturally occurring polypeptide either in its location, purity or structure. Generally, such a 
recombinant polypeptide will be present in a cell in an amount different from that normally 
observed in nature. 

25 As used herein, the term "transgene" refers to a nucleic acid sequence (encoding, for 

example, a human interferon polypeptide) that is partly or entirely heterologous, i.e., 
foreign, to the transgenic animal or cell into which it is introduced, or, is homologous to an 
endogenous gene of the transgenic animal or cell into which it is introduced, but which is 
designed to be inserted, or is inserted, into the animal's genome in such a way as to alter the 

30 genome of the cell into which it is inserted {e.g., it is inserted at a location which differs 
from that of the natural gene or its insertion results in a knockout). A transgene also 
includes a regulatory sequence designed to be inserted into the genome such that it regulates 
the expression of an endogenous coding sequence, e.g., to increase expression and or to 
change the timing and or tissue specificity of expression, etc. (e.g. , to effect "gene 

35 activation"). 
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The term "transgenic animal" as used herein refers to an animal, including an avian 
species such as a chicken, in which one or more of the cells of the animal contains 
heterologous nucleic acid introduced by way of human intervention, such as by transgenic 
techniques well known in the art or by the methods of the present invention. 
5 As used herein, a "transgenic avian" is any avian species, including but not limited 

to, chicken, turkey, duck, goose, quail, pheasants, parrots, finches, hawks, crows and ratites 
including ostrich, emu and cassowary, in which one or more of the cells of the avian may 
contain heterologous nucleic acid introduced by way of human intervention, such as by 
transgenic techniques known in the art, and particularly, as described herein. The nucleic 
10 acid is introduced into a cell, directly or indirectly by introduction into a precursor of the 
cell, by way of deliberate genetic manipulation, such as by microinjection or by infection 
with a recombinant virus. The term genetic manipulation does not include classical cross- 
breeding, or in vitro fertilization (although it does include fertilization with sperm into 
which a transgene has been introduced, but rather is directed to the introduction of a 
15 recombinant DNA molecule. This molecule may be integrated within a chromosome, or it 
may be extrachromosomally replicating DNA. In the typical transgenic avian, the transgene 
causes cells to express a recombinant form of the subject polypeptide, e.g. either agonistic 
or antagonistic forms, or a form in which the gene has been disrupted. 

The terms "chimeric animal" or "mosaic animal" are used herein to refer to animals 
20 in which the recombinant gene is found, or in which the recombinant is expressed in some 
but not all cells of the animal. The term "tissue-specific chimeric animal" indicates that the 
polypeptide encoding gene is present and expressed in some tissues, but not others. 

The term "knock-in animal" refers to an animal that carries a specific nucleic acid 
sequence such as a "knock-in sequence" in a predetermined coding or noncoding region, 
25 wherein the knock-in sequence is introduced through methods of recombination, such as 
homologous recombination. The recombination event comprises replacing all or part of a 
gene of the animal by a functional homologous gene or gene segment of another animal, 
• where the respective knock-in sequence is placed in the genomic sequence. 

The term "chromosomal positional effect (CPE)" as used herein refers to the 
30 variation in the degree of gene transcription as a function of the location of the transcribed 
locus within the cell genome. Random transgenesis may result in a transgene being inserted 
at different locations in the genome so that individual cells of a population of transgenic 
cells may each have at least one transgene, each at a different location and therefore each in 
a different genetic environment Each cell, therefore, may express the transgene at a level 
35 specific for that particular cell and dependant upon the immediate genetic environment of 



- 17 



WO 03/0.24199 



PCT/US02/30156 



the transgene. In a transgenic avian, as a consequence, different tissues may exhibit 
different levels of transgene expression. 

The terra "cytokine" as used herein refers to any secreted polypeptide that affects the 
functions of cells and is a molecule that modulates interactions between cells in the 

5 immune, inflammatory or hematopoietic response. A cytokine includes, but is not limited 
to, monokines and lymphokines regardless of which cells produce them. For instance, a 
monokine is generally referred to as being produced and secreted by a mononuclear cell, 
such as a macrophage and/or monocyte. Many other cells however also produce monokines, 
such as natural killer cells, fibroblasts, basophils, neutrophils, endothelial cells, brain 

10 astrocytes, bone marrow stromal cells, epidermal keratinocytes and B-lymphocytes. 

Lymphokines are generally referred to as being produced by lymphocyte cells. Examples of 
cytokines include, but are not limited to, Interleukin-1 (IL-1), Interleukin-6 (IL-6), 
Interleukin-8 (IL-8), Tumor Necrosis Factor-alpha (TNF-alpha) and Tumor Necrosis Factor 
P (TNF-P). 

15 The term "antibody" as used herein refers to polyclonal and monoclonal antibodies 

and fragments thereof, and immunologic binding equivalents thereof. The term "antibody" 
refers to a homogeneous molecular entity, or a mixture such as a polyclonal serum product 
made up of a plurality of different molecular entities, and may further comprise any 
modified or derivatised variant thereof that retains the ability to specifically bind an epitope. 
20 A monoclonal antibody is capable of selectively binding to atarget antigen or epitope. 
Antibodies may include, but are not limited to polyclonal antibodies, monoclonal antibodies 
(mAbs), humanized or chimeric antibodies, single chain antibodies, Fab fragments, ¥(sb% 
fragments, disulfide-linked Fvs (sdFv) fragments produced by a Fab expression library, anti- 
idiotypic (anti-Id) antibodies, intrabodies, synthetic antibodies, and epitope-binding 
25 fragments of any of the above. 

The term "immunoglobulin polypeptide" as used herein refers to a polypeptide 
derived from a constituent polypeptide of an immunoglobulin. An "inmunoglobulin 
polypeptide'* may be, but is not limited to, an immunoglobulin (preferably an antibody) 
heavy or light chain and may include a variable region, a diversity region, joining region and 
30 a constant region or any combination, variant or truncated form thereof. The term 

"immunoglobulin polypeptide" further includes single-chain antibodies comprised of, but 
not limited to, an immunoglobulin heavy chain variable region, an immunoglobulin light 
chain variable region and optionally a peptide linker. 

The term "origin of replication" (prf) as used herein refers to unique regions of a 
— w mntaininff multinle short repeated sequences, recognized by 
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muitimeric origin of replication binding proteins that organize the assembly of multiple 
enzymes and proteins required for the replication of the nucleic acid. The origin of 
replication derived from E. coli may be included in a plasmid for replication of the plasmid 
in abacterial host. The SV40 viral ori is a 65 bp region derived from the SV40 viral 
5 chromosome that when included in a nucleic acid sequence will allow replication of the 
nucleic acid in an animal cell. Inclusion of the SV40 ori region in a plasmid that also has 
the E. coli ori element will allow the plasmid to be replicated in both a bacterial host and in 
an animal cell. 

The term "centromere" as used herein refers to a small, specialized region of a 
10 chromosome recognized as a constriction in a condensed chromosome. A kinetochore lies 
within the centromeric region and is attached to microtubules extending to the poles of a 
dividing cell. 

The term "telomere" as used herein refers to repetitive oligomeric nucleic acid 
sequences located at the ends of linear eukaryotic chromosomes. Telomeres are required to 
1 5 prevent shortening of chromosomal DNA during replication of the linear nucleic acid. 

Recombinant expression vectors can be designed for the expression of the encoded 
proteins eukaryotic cells. Useful vectors may comprise constitutive or inducible promoters 
to direct expression of either fusion or non-fusion proteins. With fusion vectors, a number 
of amino acids are usually added to the expressed target gene sequence such as, but not 
20 limited to, a protein sequence for tbioredoxin. A proteolytic cleavage site may further be 
introduced at a site between the target recombinant protein and the fusion sequence. 
Additionally, a region of amino acids, such as a polymeric bistidine region, may be 
introduced to allow binding of the fusion protein to metallic ions such as nickel bonded to a 
solid support, and thereby allow purification of the fusion protein. Once the fusion protein 
25 has been purified, the cleavage site allows the target recombinant protein to be separated 
from the fusion sequence. Enzymes suitable for use in cleaving the proteolytic cleavage site 
include, but are not limited to, Factor Xa and thrombin. Fusion expression vectors that may 
be useful in the present invention include pGex (Amrad Corp., Melbourne, Australia), 
pRIT5 (Pharmacia, Piscataway, NJ) and pMAL (New England Biolabs, Beverly, MA), that 
30 fose glutathione S-transferase, protein A, or maltose E binding protein, respectively, to the 

target recombinant protein. 

Expression of a foreign gene can be obtained using eukaryotic host cells such as, but 
not limited to, mammalian or avian cells. The use of eukaryotic host cells permit partial or 
complete post-translational modification such as, but not only, glycosylate and/or the 
35 formation of the relevant inter- or intra-chain disulfide bonds. Examples of vectors useful 
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for expression in the chicken Gallus gallus include pYepSecl as in Baldari et al, 
E.M.B.O.J. 6, 229-234 (1987) and pYES2 (mvitrogen Corp., San Diego, CA), incorporated 
herein by reference in their entireties. Once the isolated DNA molecule of the present 
invention has been cloned into an expression system, it is ready to be incorporated into a 
5 host cell. 

The terms "transformation" and "transfection" as used herein refer to the process of 
inserting a nucleic acid into a cell. Many techniques are well known to those skilled in the 
art to facilitate transformation or transfection of a nucleic acid into a prokaryotic or 
eukaryotic organism. These methods involve a variety of techniques, such as treating the 
1 0 cells with high concentrations of salt such as, but not only, a calcium or magnesium salt, an 
electric field, detergent, or liposome mediated transfection, to render the host cell competent 
for the uptake of the nucleic acid molecules, and by such methods as sperm-mediated and 

restriction-mediated integration. 

The term "transfecting agent" as used herein refers to a composition of matter added 

15 to the genetic material for enhancing the uptake of heterologous DNA segments) into a 
eukaryotic cell, preferably an avian cell, and more preferably a chicken male germ cell. The 
enhancement is measured relative to the uptake in the absence of the transfecting agent 
Examples of transfecting agents include adenovirus-transferrin-polylysine-DNA complexes. 
These complexes generally augment the uptake of DNA into the cell and reduce its 

20 breakdown durmg its passage through foe cytoplasm to the nucleus of the ceU. These 

complexes can be targeted to foe male germ cells using specific ligands that are recognized 
by receptors on foe cell surface of foe germ cell, such as foe c-kit ligand or modifications 
thereof. 

Other preferred transfecting agents include, but are not limited to, lipofectin, 
25 hpfectamine, DIMRIE C, Supeffect, and Effectin (Qiagen), unifectin, maxifectin, DOTMA, 
DOGS (Transfectam; fooctadecylaimdoglycylspennine), DOPE (l,2-dioleoyl-sn-glycero-3- 
phosphoefoanolamine), DOTAP (l^-fooleoyl-3-trimefoylammonium propane), DDAB 
(dimethyl fooctadecytammonium bromide), DHDEAB (N,N-fo-n-hexadecyl-N,N- 
dihydroxyethyl ammonium bromide), HDEAB (N-n-hexadecyIN,N- 
30 dihydroxyefoylammonium bromide), polybrene, or poly(efoylenimine) (PEI). These 

nonviral agents have foe advantage that they can facilitate stable integration of xenogeneic 
DNA sequences into foe vertebrate genome, without size restrictions commonly associated 

with virus-derived transfecting agents. 

The terms "intracytoplasmic sperm injection" and "ICSF as used herein refer to 
35 delivering an exogenous nucleic acid to a recipient cell by associating foe exogenous 
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nucleic acid with the head of a sperm cell and then delivering the sperm cell head to the 
recipient cell by microinjection. The exogenous nucleic acid may be integrated into the 
endogenous genomic nucleic acid of the sperm, non-integrated as an episomal element of 
the nucleic acid complement of the sperm head, or linked internally or externally to the head 
5 of the sperm. The terms "chlCSl" and "CHICSI™" as used herein refer to intracytoplasmic 

sperm injection into a chicken cell. 

The terms "sub-zonal injection" and "SUZI" refer to delivering viable spermatozoa 
to an oocyte by microinjection, wherein the sperm are microinjected between the zona 
pellucida and the cytoplasmic membrane of an oocyte. 

1 0 The term "gene delivery (or transfection) mixture" as used herein, in the context of 

the methods of sperm mediated transfer described herein, refers to selected genetic material 
in an appropriate vector mixed, for example, with an effective amount of lipid transfecting 
agent, for example, a cationic or polycationic lipid, such as polybrene. The amount of each 
component of the mixture is chosen so that the genetic modification, e.g., by transfection or 

1 5 transduction, of a specific species of male germ cell is optimized. Such optimization 
requires no more than routine experimentation. The ratio of DNA to lipid is broad, 
preferably about 1:1, although other proportions can also be utilized depending on the type 

of lipid transfecting agent used. 

This application uses gene nomenclature accepted by the Cucurbit Genetics 
20 Cooperative as it appears in the Cucurbit Genetics Cooperative Report 18:85 (1995); herein 
incorporated by reference in its entirety. Using this gene nomenclature, genes are 
symbolized by italicized Roman letters. If a mutant gene is recessive to the normal type, 
then the symbol and name of the mutant gene appear in italicized lower case letters. 

25 3.2 A BTtRF.VT ATIONS 

Abbreviations used in the present specification include the following: aa, amino 

acid(s); bp, base pair(s); cDNA, DNA complementary to RNA; mRNA, messenger RNA; 

tRNA, transfer RNA; nt, nucleotide®; SSC, sodium chloride-sodium citrate; MAR, matrix 

attachment region; DMSO, dimethyl sulfoxide; TPLSM, two photon laser scanning , 
30 microscopy; REM, restriction enzyme mediated integration; WEFs, whole embryo 

fibroblasts. 

4. TERTEF DESCPTPTTON OF THE FIGURES 
FIGS. 1A-E illustrate the nucleotide sequence (SEQ ID NO: 6) comprising the 
35 chicken lysozyme gene expression control region (SEQ ID NO: 7), the nucleotide sequence 
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encoding the chieken expression optimized human interferon a2b (IFNMAGMAX; SEQ ID 
NO: 5) and a SV40 polyadenyiation signal sequence (SEQ ID NO: 8). 

FIG. 2 illustrates the nucleotide sequence SEQ ID NO: 5 encoding the chicken 
5 expression optimized human interferon a2b (IFNMAGMAX). 

FIGS. 3A-E illustrate the nucleotide sequence SEQ ID NO: 7 encoding the chicken 
lysozyme gene expression control region. 

1 0 FIG. 4 illustrates the nucleotide sequence SEQ ID NO: 8 encoding the SV40 

polyadenyiation signal sequence. 

FIGS. 5A-C illustrate the nucleotide sequence SEQ ID NO: 9 encoding the chicken 
lysozyme 3' domain. 

15 

FIGS. 6A-J illustrate the nucleotide sequence SEQ ID NO: 10 encoding the 
lysozyme gene expression control region (SEQ ID NO: 7) linked to the insert having the 
nucleotide sequence of SEQ ID NO: 5 encoding the chicken expression-optimized human 
interferon a2b (IFNMAGMAX) and the chicken lysozyme 3' domain SEQ ID NO: 9. 

20 

FIG. 7 illustrates the nucleotide sequence SEQ ID NO: 1 1 of the combinatorial 
promoter MDOT. 

FIGS. 8A-B illustrate the oligonucleotides and primers (SEQ ID NOS: 14-3 1) used 
25 in the formation of the chicken codon optimized human interferon o2b-encoding nucleic 
acid. 

FIG. 9 illustrates the primers (SEQ ID NOS: 32-35) used in the synthesis of the 
MDOT promoter. 

30 

FIG. 10 illustrates the level of human monoclonal antibodies IgG expressed in the 
serum of transgenic chick using ELISA. 

FIG. 1 1 illustrates the detection of EGFP positive bands from transgenic sperm. 

35 
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5. DETAILED DESCRIPTION OF THE INVENTION 
The present invention relates to methods of introducing nucleic acids into avian 
oocytes by sperm-mediated transfection to produce a transgenic chicken or quail, or other 
avian species, carrying the transgene in the genetic material in all or most of its tissue, 

5 including germ-line tissue. The methods and vectors of the present invention further 

generate transgenic avians that express heterologous genes in the serum of the avian and/or 
are deposited into an avian egg, preferably in the egg white. Vectors containing promoters 
that direct high level of expression of the heterologous protein in the avian, particularly in 
the magnum for deposition into the avian egg are provided. Additional regulatory elements, 

10 such as MARs, IRES's, enhancers, polyadenlyation signals, etc., may be included in the 
vectors of the invention to improve expression and efficiency. 

5.1 METHODS Q E TR ANSfiENESIS 
5.1.1 SPERM-MEDIATED INTEGRATION OF HETEROLOGOUS 

15 TRANSGENES 

The transgenic avians of the present invention are most preferably generated using 
sperm-mediated transfection of nucleic acid into avian oocytes. Specifically, the present 
invention provides methods for introducing nucleic acids containing a transgene, preferably, 
a nucleic acid vector of the invention as described in Section 5.2, infra, into an avian oocyte 

20 by sperm-mediated transfection. In preferred embodiments, the nucleic acid is first 
introduced into an avian sperm in vitro by lipofection, electroporation, restriction enzyme 
mediated integration (REMI) or similar methods, or in vivo by microinjection into the testis, 
and the modified avian sperm is then delivered to an avian oocyte by natural coitus after the 
modified avian sperm are returned to the testis of a male avian or in the method in which the 

25 nucleic acid has been injected directly into the testis or in vitro by microinjection, 

intracytoplasmicosperm injection (ICSI) or artificial insemination of oocytes isolated from 
an ovulating female bird, thereby generating a transgenic zygote and chick In certain 
embodiments, the male germ cells are irradiated, more preferably irradiated by gamma rays, 
before the heterologous nucleic acid is incorporated therein. In other embodiments, the 

30 testis is depopulated of sperm prior to introduction of the transfected sperm. 

The present invention contemplates that any technique capable of transferring 
heterologous material into sperm could be used so long as the technique preserves enough 
of the sperm's fertilization functions, such that the resultant sperm will be able to fertilize 
the oocyte. It is understood that the heterologous nucleic acid may be integrated into the 
35 genome of a recipient cell such as a spermatogonial cell or a spermatogonial precursor cell 
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for subsequent transfer to an embryo or the testicular material of the recipient male animal, 
preferably a chicken. It is further understood that the heterologous nucleic acid may not be 
integrated into the genome of the recipient cell but delivered as an episome which may or • 
may not be integrated into the genome of the resulting zygote or chick. 

5 

5.1.1.1 PREPARATION OF TRANSGENIC CONSTRUCT 

One aspect of the present invention relates to the preparation of a transgene which is 
to be incorporated into the genome of an avian sperm. In certain embodiments, the 
transgene comprises at least one heterologous nucleic acid. It is contemplated to be within 
10 the scope of the present invention for the heterologous nucleic acid to comprise an 

expression vector such as, but not limited to, viral vectors, plasmid vectors, or linearized 
nucleic acid vectors or a combination thereof. (See section 5.2, infra, for details on vectors, 
and the preparation thereof). The expression vector may particularly be any suitable 
nonviral vector including plasmid DNA, bacteria artificial chromosomes (BACs), yeast 
1 5 artificial chromosomes (YACs), etc. The expression vector may also be any suitable viral 
vector, for example, retroviral vectors, adenoviral vectors, transferrin-polylysine enhanced 
adenoviral vectors, human immunodeficiency virus vectors, lentiviral vectors, Moloney 
murine leukemia virus-derived vectors, and virus-derived DNAs that facilitate 
polynucleotide uptake by and release into the cytoplasm of germs cells. 
20 . Transcriptional promoters of an expression vector of the present invention may be a 
constitutively active promoter such as the cytomegaloviral promoter or Rous sarcoma virus 
promoter, or a tissue-specific promoter, preferably a tissue-specific promoter operable in 
oviduct cells of an avian species including, but not limited to, the promoters of the genes 
encoding ovalbumin, lysozyme, ovomucoid, ovotransferrin, conalbumin, and ovomucin. 
25 Optionally, the transcriptional promoter of an expression vector may be a regulatable 

promoter. The expression vector may further comprise a region encoding a transcriptional 
terminator, such as a bovine growth hormone transcriptional terminator. 

In preferred embodiments, a transgene construct comprises at least two separate or 
independent elements. A first element could comprise an oviduct-specific promoter, such 
30 as, but not limited to ovalbiimin, lysozyme, ovomucoid, ovotransferrin, conalbumin, and 
ovomucin, which would drive expression of a gene coding for a protein of interest in the 
oviduct. A second element can be located either upstream or downstream for the first 
element and comprises a protamine promoter, or a segment thereof that is sufficient to drive 
the expression of a marker gene such as the Green Fluorescent Protein (GFP) to facilitate 
3 5 identification of transfected sperm. 
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In one embodiment of the present invention, the heterologous nucleic acid comprises 
cohesive ends characterized as capable of hybridizing to cohesive ends generated by a 
restriction endonuclease. The cohesive ends on the nucleic acid may be generated by 
restriction endonuclease cleavage of a circular or linear nucleic acid, by the chemical 
5 addition of nucleotides to the ends of a linear nucleic acid, or by a combination of chemical 

and enzymatic methods. 

In another embodiment of the present invention, the heterologous nucleic acid is 
linearized and has at least one blunt end. The blunt end of the nucleic acid may be 
generated, by an exonuclease digestion of cohesive ends, such as SI nuclease. 
10 In the methods of generating transgenic cells according to the present invention, the 

genomic nucleic acid of the recipient cell, male germ cell or oocyte can be cleaved to 
receive the integrating heterologous nucleic acid. Any method may be selected that will 
generate limited, random cleavage that will allow integration of the heterologous nucleic 
acid into the genome of the recipient cell or oocyte. When the integrating heterologous 
15 nucleic acid has cohesive ends, the recipient genomic nucleic acid may be cleaved with a 
restriction endonuclease generating cohesive ends capable of hybridizing to the cohesive 
ends of the heterologous nucleic end. When the heterologous nucleic acid has blunt ends, 
the genomic nucleic acid can be cleaved by any method that will generate blunt ends at the 
cleavage site, including restriction endonuclease cleavage, or irradiation of the cell with 
20 high-energy irradiation. Suitable radiations that may be applied to the methods of the 
present invention include, for example, gamma rays, x-rays, ultraviolet light or ultrasound. 
It is contemplated that the cleavage of genomic nucleic acid and integration of a 
heterologous nucleic acid therein will result in a viable recipient cell that can be used to 
fertilize an avian oocyte, or will not yield a viable cell. A non-viable sperm cell may, 
25 however, be used to deliver the transgene to an oocyte using, for example, the ICSI 

(CHICSI™) method. 

The heterologous nucleic acid of the present invention may further comprise a 
centromere element and at least one telomere element. In one embodiment, the centromere 
and the at least one telomeres are derived from the chicken. While the ori site alone will 
30 allow replication of the heterologous nucleic acid when transfected into an oocyte or zygote 
thereof, segregation of the replicates into each daughter cell will require the optional 
centromeric element. In the absence of this centromeric element, segregation will be 
random between daughter cells with some daughter cells not receiving one copy of the 
transgenic nucleic acid. A mosaic transgenic animal would, therefore, result 

35 
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In one embodiment of the present invention, therefore, the heterologous nucleic acid 
is an artificial chromosome comprising a heterologous transgenic element having the 
properties desired to be expressed by a transgenic animal, an origin of replication (ori) site, 
and a centromere. In this embodiment, the heterologous nucleic acid may be a circular 

5 nucleic acid or a linear nucleic acid. In another embodiment, the heterologous nucleic acid 
is a linear nucleic acid further, comprising telomeres. 

In another aspect of the methods according to the present invention, the transgenic 
oocyte or ovum of the present invention is incubated for development of the zygote therein 
to a fetus, and subsequently to a chick for hatching. In one embodiment of the present 

10 invention, therefore, the zygote is incubated in a surrogate avian female, wherein the 

method comprises the steps of fistulating an avian female, delivering the avian oocyte to the 
infundibulum of the female bird, allowing the avian female to incubate the avian oocyte to 
an embryo within an egg, allowing the avian female to lay the egg, and allowing the embryo 
to hatch as a viable chick, wherein the chick is a transgenic chick having an exogenous 

15 nucleic acid incorporated therein. 



5.1.1.2 SPERM TRANSGENESIS 

The heterologous nucleic acid may be delivered to an avian male germ cell (z.e., 
sperm, spermatozoon cell or a precursor cell) by a method such as by contacting the male 

20 germ cell with a gene delivery mixture comprising a nucleic acid, either a eukaryotic viral 
vector or a vector that is not derived from a eukaryotic virus, at about or below the avian's 
body temperature and for an effective period of time such that the nucleic acid is 
incorporated into the cell, and preferably into the genome of the cell, optionally isolating or 
selecting the genetically modified cell with the aid of a genetic selection marker expressed 

25 in the genetically modified cell, transferring the isolated or selected genetically modified 
germ cell to a testis of a recipient male avian such that the cell lodges in a seniiniferous 
tubule of the testis. A genetically modified male gamete may be produced therein, and 
breeding the recipient male avian with a female avian of its species will generate transgenic 
progeny that cany the heterologous transgenic nucleic acid in its genome. 

30 In certain embodiments, the avian male germ cells are isolated and removed from a 

male avian. The avian male germ cells is then transfected by introducing the heterologous 
nucleic acid into the genome of the avian male germ cells by lipofection, electroporation, 
restriction enzyme mediated infection (REMI) or similar methods. In certain other 
embodiments, the heterologous nucleic acid is injected directly into the testis of the male 

35 avian for transfection. Male germ cells can be extracted to determine whether transfection 
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has occurred or the extent of transfection. The male avian can be mated with a female avian 
to produce transgenic offsprings or the sperm can be used for IVF. 

The precursor cell may be selected from the group consisting of spermatogonial 
stem cells, type B spermatogonia, primary spermatocytes, preleptotene spermatocytes, 
5 leptotene spermatocytes, zygotene spermatocytes, pachytene spermatocytes, secondary 
spermatocytes, and spermatids. The embodiment further comprises the steps of 
incorporating the heterologous transgene into the genome of the spermatozoon cell or the 
precursor cell, so that a genetically modified male gamete is produced by the male avian, 
and breeding the male avian with a female of the same species such that a transgenic 
10 progeny is thereby produced that carries the polynucleotide in its genome. 

In certain embodiments, the heterologous genetic material may be introduced into 
the genome of an avian male germ cell, such that a polynucleotide is delivered using known 
gene delivery systems to male germ cells in situ in the testis of the male avian (e.g., by in 
vivo transfection or transduction). In one embodiment, the invention relates to an in vitro 
1 5 method of incorporating heterologous genetic material into the genome of a male avian by 
isolating male germ cells ex corpora, delivering a polynucleotide thereto, and then returning 
the transfected cells to the testes of a recipient male bird. In yet another embodiment, the in 
vitro method involves microinjecting the recombinant male germ cells into a recipient 
fertilized oocyte, whereupon the sperm head enters the oocyte nucleus to deliver the 

20 heterologous nucleic acid thereto. 

In a preferred embodiment, the invention relates to an in vivo method that injects a 
gene delivery mixture, preferably into the seminiferous tubules, or into the testis, and most 
preferably into the vas efferens or vasa efferentia using, for example, a micropipette and a 
picopump delivering a precise measured volume under controlled amounts of pressure. The 

25 modified germ cells differentiate in their own milieu. Progeny animals exhibiting the 
nucleic acid's integration into its germ cells (i,e., transgenic animals) are selected The 
selected progeny can then be mated, or their sperm utilized for insemination or in vitro 
fertilization, to produce further generations of transgenic progeny or for microinjection into 
isolated oocytes. 

30 In another preferred embodiment, the invention relates to an in vitro method wherein 

male germ cells are obtained or collected from a donor male avian, by any means known in 
the art such as, for example, transection of the testes. The male germ cells are then exposed 
to a gene delivery mixture, preferably within several hours of collection, or cryopreserved 
for later use. When the male germ cells are obtained from the donor avian by transection of 

35 the testes, the cells can be incubated in an enzyme mixture known for gently breaking up the 
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tissue matrix and releasing undamaged cells. Suitable enzymes to disrupt the integrity of a 
tissue include, but are not limited to, pancreatic trypsin, collagenase type I, pancreatic 
DNAse type I, as well as bovine serum albumin and a modified DMEM medium. After 
washing the cells, they can be placed in an incubation medium such as DMEM or the like, 
5 and plated on a culture dish for genetic modification by exposure to a gene delivery mixture. 
In other embodiments, a transgene can be incorporated into an avian sperm by 
lipofection, electroporation, restriction enzyme mediated integration (REMI), 
intracytoplasmic sperm injection (ICSI) or similar methods. 

1 0 Liposome 

In a preferred embodiment, a transgene is incorporated into an avian sperm by 
liposomes. The male germ cells, which may be intact and viable spermatozoa, or the non- 
viable heads thereof, may be transfer to a recipient oocyte using liposome-mediated 
delivery. PCT Publication WO 87/05325, which is incorporated by reference herein in its 
15 entirety, discloses a method of transferring organic and/or inorganic material into sperm or 
egg cells by using liposomes. The heterologous nucleic acid can also be incorporated into a 
male sperm using Lipofectin-based liposomes. (See, e.g., Bachiller et ah, 1991, Mol. 
Reprod Develop. 30: 194-200; Nakanishi and Iritani, 1993, Mol Reprod. Develop. 36: 258- 

261). 

20 

Electroporation 

In another preferred embodiment, a transgene is incorporated into an avian sperm by 
electroporation. The application of electrical current has been shown to enhance the uptake 
of exogenous DNA fragments by cultured cells. Enhancement of nuclear uptake of the 
25 heterologous DNA will promote earlier chromosomal integration of the exogenous DNA 
molecules, thus reducing the degree of genetic mosaicism observed in transgenic avian 
founders. 

In one embodiment, the male germ cells is placed in a cuvette and a solution of the 
transgenic nucleic acid coding the protein of interest is added. A direct current pulse is 
30 discharged in the cuvette suspension. The current pulse creates temporary, short-lived pores 
in the cell membrane and allow the male germ cells to take up the transgene while only 
slightly compromising cell viability. More description on the use of electroporation to 
incorporate DNA can be found in Gagne et al, 1991, Mol. Reprod. Develop. 29: 6-15, 
which is incorporated herein by reference in its entirety. 

35 
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Restriction Enzyme Mediated Integration (REMJ) 

' In yet another preferred embodiment, a transgene is incorporated into an avian sperm 
by restriction enzyme mediated integration (REMI). The heterologous nucleic acid to be 
integrated into, for example, the sperm nuclear DNA is converted to a linear double 
5 stranded DNA possessing single-stranded cohesive ends by contacting the heterologous 
DNA with a type H restriction enzyme that upon scission, generates such ends. The nucleic 
acid to be cut can be a circular nucleic acid such as in a plasmid or a viral vector or a linear 
nucleic acid that possesses at least one recognition and cutting site outside of the genes or 
regulatory regions critical to the desired post-integration function of the nucleic acid, and no 
10 recognition and cutting sites within the critical regions. 

Alternatively the heterologous DNA to be integrated into the sperm nuclear DNA 
can be prepared by chemically and/or enzymatically adding cohesive ends to a linear DNA. 
The added cohesive ends must be able to hybridize to the cohesive ends characteristic of a 
nucleic acid cleaved by a type EE restriction endonuclease. Alternatively, the cohesive ends 
15 can be added by combining the methods based on type H restriction enzyme cutting and 
chemical and/or enzymatic addition. It is also within the scope of the present invention for 
the linearized nucleic acid to have one end that is a blunt end without unpaired nucleotides. 
Such blunt ends can be generated by restriction endonuclease digestion, exonuclease 
digestion of cohesive ends or fill-in of cohesive ends by polynucleotide synthesis, using 
20 methods as described, for example, in Sambrook et al, (svpra), incorporated herein by 

reference in its entirety. 

It is also to be understood that a nucleic acid to be delivered to a recipient cell may 
be cleaved with two different restriction endonucleases that may generate the same or 
different cohesive termini, or at least one blunt-end terminus. Neither restriction 
25 endonucleases will have a recognition site within the nucleic acid sequence required to be a 

transgene in the recipient cell. 

When a restriction endonuclease is used to cleave the genomic nucleic acid of the 
recipient cell, the endonuclease may be co-delivered to the recipient cell such as a sperm 
cell with the heterologous nucleic acid, or sequentially delivered. If a nucleic acid is 
30 cleaved with at least two restriction endonucleases, thereby generating at least one cohesive 
terminus, the at least two endonucleases may be delivered to a recipient cell either together 
or sequentially. The transfected nucleic acid may be mixed with at least one of the 
endonucleases or delivered to a recipient cell before or after at least one endonuclease is 
delivered thereto. 



35 
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At least one tennimis of a linearized nucleic acid to be delivered to a recipient cell 
may be a blunt end terminus, generated by endonuclease cleavage, chemical synthesis, 
enzyme directed nucleic acid digestion or synthesis, or any combination thereof. A 
recipient cell genome such as a sperm cell genome, may therefore be cleaved before, during 
5 or after delivery of the linearized nucleic acid to the cell, by delivery of a blunt-end 

generating restriction endonuclease to the recipient cell, or by radiation-induced cleavage. 
Suitable radiations that may be applied to, for example, a sperm cell include, but are not 
limited to, gamma radiation, x-rays, ultraviolet light and ultrasound. The dose and duration 
of the radiation applied to a cell sample are determined for each sample, for levels of 
10 cleavage that will allow integration of the transfected nucleic acid into the cell genome, 
while maintaining viability of the cells for use in artificial insemination or recolonization of 
an avian testes. Viability of a recipient sperm may not be required when the transfected 
sperm are delivered to a recipient avian oocyte by such procedures as ICSI or CHICSI™. 
Cleavage of the genomic nucleic acid by irradiation or ultrasound can be either before, 
1 5 during or after delivery of the heterologous nucleic acid to the recipient cell. 

While not wishing to be bound by any one theory, the transfected nucleic acid may 
be integrated into a cleavage site of the genomic nucleic acid. Integration may be facilitated 
by the cohesive ends on the heterologous nucleic acid that hybridize to the like cohesive 
ends of the cleaved genomic nucleic acid. The integrated heterologous nucleic acid will 
20 then replicate and segregate with the genome of the recipient cell 

Alternatively, the heterologous nucleic acid may not be integrated into a recipient 
genome, but will remain as an extrachromosomal episome. The heterologous nucleic acid 
of the present invention may circularize by hybridization of the cohesive ends of the nucleic 
acid, raiher than be integrated into the genome. When the heterologous nucleic acid 
25 comprises any natural or synthetic origin of replication (on element) the nucleic acid will be 
capable of replicating independently of the recipient genome. In one embodiment of the 
present invention the on site included with a heterologous nucleic acid is derived from the 
SV40 virus. Episomal replication and segregation of daughter copies of the episome is 
facilitated by the linearized viral ori site and/or a centromere isolated from, for example, a 
30 chicken chromosome, thereby generating a chicken artificial chromosome. In another 
embodiment, the linearized heterologous nucleic acid will not be integrated into the genome 
of the recipient cell but remain as a separate unit that, because of a centromeric structure 
incorporated therein, will segregate into daughter cells during mitotic division. In this case, 
the unincorporated episomal heterologous nucleic acid is a chicken artificial chromosome 
35 (CAQ. 
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The REMI method for stably integrating heterologous DNA into the genomic DNA 

of a recipient cell is described by Shemesh et al. in PCT Publication No. WO 99/42569 and 
• incorporated herein by reference in its entirety. This REMI method comprises in part an 

adaptation of the REMI technique disclosed by Schiest and Petes (Proc. Nat. Acad. Sci. 
5 U.S.A. 88, 7585-7589 (1991)) and Kuspa and Loomis (Proc. Nat. Acad. Sci. U.S.A., 89, 

8803-8807 (1 992)) both incorporated herein by reference in their entireties. 

In preferred embodiments, the avian sperm are irradiated before being exposed to 

gene delivery mixture or having a transgene incorporated therein. The male germ cells can 

be irradiated with a suitable dose of gamma irradiation, preferably, 1 Gy, 2 Gy, 3 Gy, 4 Gy, 
10 5 Gy, 6 Gy, 7 Gy, 8 Gy, 9 Gy, 10 Gy, 1 1 Gy, 12 Gy, 1 5 Gy or 20 Gy, without compromising 

the viability and/or mobility of the sperms. (See Wooster et al., 1977, Can. J. Genet. Cytol 

19: 437-446). 

Whether employed in the in vivo, in situ or in vitro method, the gene delivery 
mixture, once in contact with the male germ cells, facilitates the uptake and transport of 

1 5 heterologous genetic material into the appropriate cell location for integration into the 
genome and expression. A number of known gene delivery methods can be used for the 
uptake of nucleic acid sequences into the cell and facilitate the integration of the 
heterologous nucleic acid into the genome of the recipient cell. Such methods include, but 
are not limited to viral vectors, liposomes, electroporation, REMI, and ICSL 

20 A gene delivery mixture suitable for use in the in vivo, in situ or in vitro methods of 

sperm-mediated transfection comprises a nucleic acid encoding a desired trait or product, 
and a suitable promoter sequence such as, for example, a tissue-specific promoter, or an 
IRES. The transgenic nucleic acids of the present invention may further comprise an origin 
of replication. For example, an origin of replication may be the SV40 ori, or a centromere 

25 derived fiom the chicken. A linear nucleic acid may further comprise a telomere at one or 

both ends of the nucleic acid. 

Optionally, agents that increase the uptake of, or comprise non-eukaryotic viral 
vectors, e.g., plasmids, BACs, YACs, etc., the nucleic acid sequence, such as liposomes, 
retroviral vectors, adenoviral vectors, adenovirus enhanced gene delivery systems, or 

30 combinations thereof may be included in the gene delivery mixture. A reporter construct, 
including a genetic selection marker, such as the gene encoding for Green Fluorescent 
Protein, may also be added to the gene delivery mixture. Targeting molecules, such as c-kit 
ligand, can be added to the gene delivery mixture to enhance the transfer of genetic material 
into the male germ cell. An immunosuppressing agent, such as cyclosporin or a 

35 corticosteroid may also be added to the gene delivery mixture as known in the art. 
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Any of a number of commercially available gene delivery mixtures can be used, to 
which the polynucleotide encoding a desire trait or product is further admixed. The final 
gene delivery mixture comprising the polynucleotide can then be admixed with the male 
gamete cells and allowed to interact for a period of between about 2 hours to about 1 6 
5 hours, at a temperature of about 33°C to about 37°C. After this period, the cells are 

preferably placed at a lower temperature of about 33°C to about 34°C, for about 4 hours to 
about 20 hours, preferably about 16 to about 18 hrs. 

Isolating and/or selecting genetically transgenic germ cells (and transgenic somatic 
cells, and of transgenic vertebrates) is by any suitable means, such as, but not limited to, 
1 0 physiological and/or morphological phenotypes of interest using any suitable means, such as 
biochemical, enzymatic, immunochemical, histologic, electrophysiologic, biometric or like 
methods, and analysis of cellular nucleic acids, for example the presence or absence of 
specific DNAs or RNAs of interest using conventional molecular biological techniques, 
including hybridization analysis, nucleic acid amplification including, but not limited to, 
1 5 polymerase chain reaction, Iranscription-mediated amplification, reverse transcriptase- 
mediated ligase chain reaction, and/or electrophoretic technologies. 

One preferred method of isolating or selecting male germ cell populations comprises 
obtaining specific male germ cell populations, such as spermatogonia, from a mixed 
population of testicular cells by extrusion of the cells from the seminiferous tubules and 
20 enzyme digestion. The spermatogonia, or other male germ cell populations, can be isolated 
from a mixed cell population by methods such as the utilization of a promoter sequence that 
is specifically or selectively active in cycling male germ line stem cell populations. Suitable 
promoters include B-Myb or a specific promoter, such as the c-kit promoter region, c-raf-1 
promoter, ATM (ataxia-telangiectasia) promoter, vasa promoter, RBM (ribosome binding 
25 motif) promoter, DAZ (deleted in azoospermia) promoter, XRCC-1 promoter, HSP 90 (heat 
shock gene) promoter, cyclin Al promoter, or FRM (from Fragile X site) promoter and the 
like. A selected promoter may be linked to a reporter construct, for example, a construct 
comprising a gene encoding Green Fluorescent Protein (or EGFP), Yellow Fluorescent 
Protein, Blue Fluorescent Protein, a phycobiliprotein, such as phycoerythrin or phycocyanin, 
30 or any other protein which fluoresces under suitable wave-lengths of light, or encoding a 
Ught-emitting protein, such as luciferase or apoaequorin. The unique promoter sequences 
drive the expression of the reporter construct only during specific stages of male germ cell 
development (e.g., Mailer etal, 1999,7. Biol. Chem 276(16), 11220-28; Schrans-Stassen 
et al., 1999, Endocrinology 140, 5894-5900, both of which are incorporated herein by 
35 reference in their entireties). In the case of a fluorescent reporter construct, the cells can be 
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sorted, with the aid of, for example, a F ACS set at the appropriate wavelength(s), or they can 

be selected by chemical methods. 

Male germ cells that have the DNA modified in the desired manner are isolated or 
selected, and transferred to the testis of a suitable recipient avian, preferably the donor male 
5 avian of the male germ cells. Further selection can be attempted after biopsy of one or both 
of the recipient male's testes, or after examination of the animal's ejaculate amplified by the 
polymerase chain reaction to confirm that the desired nucleic acid sequence had been 
incorporated. 

The genetically modified germ cells isolated or selected as described above are 
10 transferred to a testis of a suitable male avian, preferably a chicken, that can be, but need not 
be, the same donor animal. Before transferring the genetically modified male germ cells to 
the recipient animal, the testes of the recipient are depopulated of endogenous germ cells, 
thereby facilitating the colonization of the recipient testis by the genetically modified germ 
cells. Depopulation of the testis has commonly been accomplished by exposing the whole 
1 5 animal to gamma irradiation or by localized irradiation of the testis. The basic rigid 

architecture of the gonad should not be destroyed, nor significandy damaged. Disruption of 
tubules may lead to impaired transport of testicular sperm and result in infertility. Sertoli 
cells should not be irreversibly damaged, as they provide a base for development of the 
germ cells during maturation, and for preventing the host immune defense system fiom 
20 destroying grafted foreign spermatogonia. 

Suitable denuding methods, include irradiation by gamma-rays, x-rays, ultrasound, 
ultraviolet light, by chemical treatment, by means of infectious agents such as viruses, or by 
autoimmune depletion or by combinations thereof, preferably by a combined treatment of 
the vertebrate with an alkylating agent and gamma irradiation as taught in WO 00/69257, 
25 incorporated herein by reference in its entirety. 

Gamma radiation-induced spermatogonial degeneration probably related to the 
process of apoptosis. (Hasegawa et aL, 1998, Radiat. Res. 149:263-70). Alternatively, a 
composition containing an alkylating agent such as busulfan (MYLERAN™) can be used to 
depopulate. (Jiang F.X., 1998, Anal Embryol. 198(1): 53-61; Russell and Brinster, 1996, J. 
30 Androl. 17(6): 615-27; Boujrad et al, 1995, Andrologia 27(4): 223-28; Linder et al, 1992, 
Reprod Toxicol. 6(6): 491-505; Kasuga and Takahashi, 1986, Endocrinol. Jpn 33( 1): 105- 
1 5). Other cytotoxic alkylating agent, may be, but is not limited to, chlorambucil, 
cyclophosphamide, melphalan, or ethyl ethanesulfonic acid, and may be combined with 
gamma irradiation, to be adjninistered in either sequence. 

35 
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The dose of the alkylating agent and the dose of gamma radiation are in an amount 
sufficient to substantially depopulate the testis. The alkylating agent can be administered by 
any pharmaceutically acceptable delivery system, including but not limited to, 
intraperitoneal, intravenous, or intramuscular injection, intravenous drip, implant, 
transdermal or transmucosal delivery systems. 

The isolated or selected genetically modified germ cells are transferred into the 
recipient testis by direct injection using a suitable micropipette. Support cells, such as 
Leydig or Sertoli cells, that can be unmodified or genetically modified, can be transferred to 
a recipient testis along with the modified germ cells. 



10 



5.1.1.3 DELIVERY OF TRANSGENIC SPERM TO OOCYTES 

The transfected male avian germ cells may be used to deliver a heterologous nucleic 
acid to an avian oocyte by implanting the transfected male germ cells such as transfected 
spermatogonial precursor cells, into the testicular tissue of host male birds previously 
1 5 denuded of viable spermatogonial cells or sperm. The implanted transfected male avian 
germ cells may colonize the testicular tissue, proliferate therein, and generate viable 
transgenic sperm that may be harvested for use in artificial insemination procedures, or 
transferred to a recipient oocyte by natural coitus. 

In certain embodiments, therefore, the transgenic avian may be produced by the 
20 sperm-mediated transfer of at least one heterologous transgene. The transgene may be 
incorporated into the genomic nucleic acid of a spermatozoon cell or a precursor thereof, so 
that a genetically modified avian sperm is produced by the male avian. Breeding the male 
avian with a female of its species will generate a transgenic progeny carrying the at least one 

transgene in its genome. 

25 A union of male and female gametes to form a transgenic zygote is brought about by 

copulation of the male and female vertebrates of the same species, or by in vitro or in vivo 
artificial means. If artificial means are chosen, then incorporating into the genome a genetic 
selection marker mat is expressed in male germ cells is particularly useful. 

Suitable artificial means include, but are not limited to, artificial insemination, in 

30 vitro fertilization (IVF) and/or other artificial reproductive technologies, such as 

intracytoplasmic sperm injection (ICSI), subzonal insemination (SUZI), or partial zona 
dissection (PZD). Also others, such as cloning and embryo transfer, cloning and embryo 

splitting, and the like, can be employed. 

In a preferred embodiment, a transgene is incorporated into an avian sperm by 
35 intracytoplasmic sperm injection (ICSI). The male germ cells, which may be intact and 
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viable spermatozoa, or the non-viable heads thereof, may be microinjected into the 
cytoplasm or the nucleus of an isolated oocyte such as an avian oocyte, preferably a chicken 
oocyte, by any method known to one of skill in the art, including, for example, combining a 
confocal microscope and micromanipulator, or the like to visualize and monitor the 

5 microinjection of an opaque avian egg. 

The transgenic vertebrate progeny can, in turn, be bred by natural mating, artificial 
insemination, or by in vitro fertilization (TVF) and/or other artificial reproductive 
technologies, such as intracytoplasmic sperm injection (ICSI) and chicken intracytoplasmic 
sperm injection (CfflCSI™), subzonal insemination (SUZI), or partial zona dissection 

10 (PZD), to obtain further generations of transgenic progeny. Although the genetic material is 
originally inserted solely into the germ cells of a parent animal, it will ultimately be present 
in the germ cells of future progeny and subsequent generations thereof. In addition, the 
genetic material will also be present in cells of the progeny other than germ cells, i.e., 
somatic cells. 

1 5 The methods of the present invention may further comprise returning a transfected 

fertilized oocyte to a surrogate mother, especially a female chicken, for the continued 
incubation and development of the transgenic zygote. With chickens, the developed embryo 
is laid as a hard-shell egg that will hatch as a viable chick. When the heterologous nucleic 
acid is directly integrated into the genome of the oocyte, the transgenic chick will include 

20 the transgenic heterologous nucleic acid in all of its cells. Where the heterologous nucleic 
acid is episomal with respect to the genome of the transgenic zygote and chick, and the 
episomal nucleic acid comprises a centromeric body, most, if not all, of the cells of the 
zygote and chick will comprise Ihe heterologous nucleic acid. When the episomal nucleic 
acid does not include a centromeric body, however, the transgenic zygote and chick can be a 

25 mosaic wherein expression of the exogenous transgene will only occur in some, but not all 
cells or tissues of the transgenic animal. 

5.1.2 BREEDING AND MAINTENANCE OF TRANSGENIC AVIAN 
Another aspect of the present invention is a transgenic avian produced by the 
30 methods of the present invention and producing a heterologous polypeptide in an egg, 
wherein the transgenic avian comprises at least one heterologous nucleic acid sequence 
encoding the polypeptide and wherein the heterologous polypeptide is delivered to the white 

of an avian egg by a female of the avian. 

The invention relates to a method of producing transgenic avians that express 
35 significant quantities of useful heterologous proteins, e.g., therapeutic and diagnostic 
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proteins, including immunoglobulins, industrially useful proteins and other biologies etc. in 
the avian egg white. The heterologous protein can then be readily purified from the avian 
egg. The methods of the invention provide improved efficiencies of transgenesis, 
transmission of the transgene and/or level of heterologous protein expression. Another 
5 aspect of the invention is a method of producing a transgenic avian capable of expressing a 
heterologous protein. Therefore, the present invention relates to methods of producing 
transgenic avians, preferably chickens, wherein the incorporated transgene may be 
expressed as a constituent protein of the white of a hard-shell egg. 

Although the genetic material is originally inserted solely into the germ cells of a 
1 0 parent animal, it will ultimately be present in the germ cells of future progeny and 

subsequent generations thereof. In addition, the genetic material will also be present in cells 
of the progeny other than germ cells, i.e., somatic cells. 

Using the methods of the invention for producing transgenic avians, particularly 
methods using vectors that are not derived from eukaryotic viruses, and, preferably, the 
15 methods of cytoplasmic micro-injection described herein, the level of mosaicism of the 
transgene (percentage of cells containing the transgene) in avians hatched from 
microinjected embryos (i.e., the G 0 s) is greater than 5%, 10* 25%, 50%, 75% or 90%, or is 
the equivalent of one copy per one genome, two genomes, five genomes, seven genomes or 
eight genomes, as determined by any number of techniques known in the art and described 
20 infra. 

In additional particular embodiments, the percentage of GOs that transmit the 
transgene to progeny (Gls) is greater than 5%, preferably, greater than 10%, 20%, 30%, 
40%, and, most preferably, greater man 50%, 60%, 70%, 80%, 90%. In other embodiments, 
the transgene is detected in 10%, 20%, 30%, 40%, and most preferably, greater than 50%, 
25 60%, 70%, 80%, 90% of chicks hatching from embryos into which nucleic acids have been 
introduced using methods of the invention. 

5.2 VECTORS 

A variety of vectors useful in carrying out the methods of the present invention are 
30 described herein. These vectors may be used for stable introduction of a selected 

heterologous polypeptide-coding sequence (and/or regulatory sequences) into the genome of 
an avian, in particular, to generate transgenic avians that produce exogenous proteins in 
specific tissues of an avian, and in the oviduct in particular, or in the serum of an avian. In 
still further embodiments, the vectors are used in methods to produce avian eggs containing 
35 exogenous protein. 
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In particular embodiments, preferably for use in the sperm-mediated transgenesis 
methods described herein, the vectors of the invention are not derived from eukaryotic viral 
vectors or retroviral vectors (except in certain embodiments for containing eukaryotic viral 
regulatory elements such as promoters, origins of replication, etc). In particular 
5 embodiments, the vector is not an REV, ALV or MuLV vector. In particular, useful vectors 
include, bacteriophages such as lambda derivatives, such as Xgtl 1, Xgt WES.tB, Charon 4, 
and plasmid vectors such as pBR322, pBR325, pACYC177, pACYC184, pUC8, P UC9, 
pUC18, pUC19, pLG339, pR290, pKC37, pKClOl, SV40, pBLUESCRIPT® II SK +/- or KS 
+/- (see "Stratagene Cloning Systems" Catalog (1993) from STRATAGENE®, La Jolla, 
1 0 Calif., which is hereby incorporated by reference), pQE, pIH821 , pGEX, pET series (see 
Studier, F.W. et. al, 1990, "Use of T7 RNA Polymerase to Direct Expression of Cloned 
Genes" Gene Expression Technology 185, which is hereby incorporated by reference) and 
any derivatives thereof, cosmid vectors and, in preferred embodiments, artificial 
chromosomes, such as, but not limited to, YACs, BACs, BBPACs or PACs. Such artificial 
1 5 chromosomes are useful in that a large nucleic acid insert can be propagated and introduced 
into the avian cell. 

In other particular embodiments, as detailed above in section 5.2, infra, the vectors 
of the invention are derived from eukaryotic viruses, preferably avian viruses, and can be 
replication competent or, preferably, replication deficient In particular embodiments, the 
20 vectors are derived from REV, ALV or MuLV. Nucleic acid sequences or derivative or 
truncated variants Ihereof, may be introduced into viruses such as vaccinia virus. Methods 
for making a viral recombinant vector useful for expressing a protein under the control of 
the lysozyme promoter are analogous to the methods disclosed in U.S. Patent Nos. 
4,603,112; 4,769,330; 5,174,993; 5,505,941; 5,338,683; 5,494,807; 4,722,848; Paoletti, R, 

25 \9%,Proc. Natl. Acad. Sci. 93: 11349-11353; Moss, 1996, Proc. Natl Acad Sci. 93: 
11341-11348; Roizman, 1996, Proc. Natl. Acad Sci. 93: 11307-11302; Frolov etal, 1996, 
Proc. Natl. Acad Sci. 93: 11371-11377; Grunhaus etal, 1993, Seminars in Virology 3: 
237-252 and U.S. Patent Nos. 5,591,639; 5,589,466; and 5,580,859 relating to DNA 
expression vectors, inter alia; the contents of which are incorporated herein by reference in 

30 their entireties. 

Recombinant viruses can also be generated by transfection of plasmids into cells 
infected with virus. 

Preferably, vectors can replicate (i.e., have a bacterial origin of replication) and be 
manipulated in bacteria (or yeast) and can then be introduced into avian cells. Preferably, 
35 the vector comprises a marker that is selectable and/or detectable in bacteria or yeast cells 
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and, preferably, also in avian cells, such markers include, but are not limited to, Amp r , tet r , 
LacZ, etc. Preferably, such vectors can accommodate (i.e., can be used to introduce mto 
cells and replicate) large pieces of DNA such as genomic sequences, for example, large 
pieces of DNA consisting of at least 25 kb, 50 kb, 75 kb, 1 00 kb, 1 50 kb, 200 kb or 250 kb, 

5 such as B ACs, YACs, cosmids, etc. 

The insertion of a DNA fiagment into a vector can, for example, be accomplished by 
ligating the DNA fragment into a vector that has complementary cohesive termini. 
However, if the complementary restriction sites used to fragment the DNA are not present 
in the vector, the ends of the DNA molecules may be enzymatically modified. 

10 Alternatively, any site desired may be produced by ligating nucleotide sequences (linkers) 
onto the DNA termini; these ligated linkers may comprise specific chemically synthesized 
oligonucleotides encoding restriction endonuclease recognition sequences. In an alternative 
method, the cleaved vector and the transgene may be modified by homopolymeric tailing. 
The vector can be cloned using methods known in the art, e.g., by the methods 

15 disclosed in Sambrook et aL, (supra); Ausubel et aL, 1989, Current Protocols in Molecular 
Biology, Green Publishing Associates and Wiley Interscience, N.Y., both of which are 
hereby incorporated by reference in their entireties. Preferably, the vectors contain cloning 
sites, for example, restriction enzyme sites that are unique in the sequence of the vector and 
insertion of a sequence at that site would not disrupt an essential vector function, such as 

20 replication. 

As discussed above, vectors used in certain methods of the invention preferably can 
accommodate, and in certain embodiments comprise, large pieces of heterologous DNA 
such as genomic sequences, particularly avian genomic sequences. Such vectors can 
contain an entire genomic locus, or at least sufficient sequence to confer endogenous 
25 regulatory expression pattern, e.g., high level of expression in the magnum characteristic of 
ovalbumin, lysozyme, ovomucoid, ovotransferrin, conalbumin, and ovomucin, etc, and to 
insulate the expression of the transgene sequences from the effect of regulatory sequences 
surrounding the site of integration of the transgene in the genome. Accordingly, as detailed 
below, in preferred embodiments, the transgene is inserted in an entire genomic loci or 

30 significant portion thereof. 

To manipulate large genomic sequences contained in, for example, a BAG, 
nucleotide sequences coding for the heterologous protein to be expressed and/or other 
regulatory elements may be inserted into the BAC by directed homologous recombination in 
bacteria, e.g., the methods of Heintz WO 98/59060; Heintz et aL, WO 01/05962; Yang et 
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al, 1997, Nature Biotechnol. 15: 859-865; Yang etal, 1999, Nature Genetics 22: 327-35; 
-which are incorporated herein by reference in their entireties. 

Alternatively, the BAC can also be engineered or modified by "E-T cloning," as 
described by Muyrers et al. (1999, Nucleic Acids Res. 27(6): 1555-57, incorporated herein 
5 by reference in its entirety). Using these methods, specific DNA may be engineered into a 
BAC independently of the presence of suitable restriction sites. This method is based on 
homologous recombination mediated by the recE and red proteins ("ET-cloning") (Zhang 
et al, 1998, Nat. Genet. 20(2): 123-28; incorporated herein by reference in its entirety). 
Homologous recombination can be performed between a PCR fragment flanked by short 

1 0 homology arms and an endogenous intact recipient such as a BAC. Using this method, 
homologous recombination is not limited by the disposition of restriction endonuclease 
cleavage sites or the size of the target DNA. A BAC can be modified in its host strain using 
a plasmid, e.g. , pBAD-apy, in which recE and recT have been replaced by their respective 
functional counterparts of phage lambda (Muyrers et al, 1999, Nucleic Acids Res. 27(6): 

1 5 1 555-57). Preferably, a BAC is modified by recombination with a PCR product containing 
homology arms ranging from 27-60 bp. In a specific embodiment, homology arms are 50 
bp in length. 

In another embodiment, a transgene is inserted into a yeast artificial chromosome 
(YAC) (Burke et al, 1987, Science 236: 806-12; and Peterson et al, 1997, Trends Genet. 

20 13:61, both of which are incorporated by reference herein in then entireties). 

In other embodiments, the transgene is inserted into another vector developed for the 
cloning of large segments of genomic DNA, such as a cosmid or bacteriophage PI 
(Sternberg etal, 1990, Proc. Natl. Acad. Set USA 87: 103-07). The approximate 
maximum insert size is 30-35 kb for cosmids and 100 kb for bacteriophage PI. In another 

25 embodiment, the transgene is inserted into a P-l derived artificial chromosome (PAC) 
(Mejia et al. , 1 997, Genome Res 7:179-1 86). The maximum insert size is 300 kb. 

Vectors containing the appropriate heterologous sequences may be identified by any 
method weU known in the art, for example, by sequencing, restriction mapping, 
hybridization, PCR amplification, etc. 

30 The vectors of the invention comprise one or more nucleotide sequences encoding a 

heterologous protein desired to be expressed in the transgenic avian, as well as regulatory 
elements such as promoters, enhancers, MARs, IRES's and other translation control 
elements, transcriptional termination elements, polyadenylation sequences, etc, as discussed 
infra. In particular embodiments, the vector of the invention contains at least two 

35 
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nucleotide sequences coding for heterologous proteins, for example, but not limited to, the 
heavy and light chains of an immunoglobulin. 

In a preferred embodiment, the nucleotide sequence encoding the heterologous 
protein is inserted into all or a significant portion of a nucleic acid containing the genomic 
5 sequence of an endogenous avian gene, preferably an avian gene that is expressed in the 
magnum, e.g., ovdburnin, lysozyme, ovomucoid, ovotransferrin, conalbumin, and 
ovomucin, etc. For example, the heterologous gene sequence may be inserted into or 
replace a portion of the 3 1 untranslated region (UTR) or 5' untranslated region (UTR) or an 
intron sequence of the endogenous gene genomic sequence. Preferably, the heterologous 
10 gene coding sequence has its own IRES. For descriptions of IRES's, see. e.g., Jackson et 
al, 1990, Trends Biochem Set 15(12):477-83; Jang et al, 1988, J. Virol. 62(8):2636-43; 
Jang et al, 1990, Etnyme 44(l-4):292-309; and Martinez-Salas, 1999, Curr. Opin. 
Biotechnol. 10(5):458-64; Palmenberg et al, United States Patent No. 4,937,190, which are 
incorporated by reference herein in their entireties. In another embodiment, the 
1 5 heterologous protein coding sequence is inserted at the 3' end of the endogenous gene 
coding sequence. In another preferred embodiment, the heterologous gene coding 
sequences are inserted using 5 1 direct fusion wherein the heterologous gene coding 
sequences are inserted in-frame adjacent to the initial ATG sequence (or adjacent the 
nucleotide sequence encoding the first two, three, four, five, six, seven or eight amino acids) 
20 of the endogenous gene or replacing some or all of the sequence of the endogenous gene 
coding sequence. In yet another specific embodiment, the heterologous gene coding 
sequence is inserted into a separate cistron in the 5' region of the endogenous gene genomic 
sequence and has an independent IRES sequence. 

The present invention further relates to nucleic acid vectors (preferably, not derived 
25 from eukaryotic viruses, except, in certain embodiments, for eukaryotic viral promoters and/ 
or enhancers) and transgenes inserted therein that incorporate multiple polypeptide- 
encoding regions, wherein a first polypeptide-encoding region is operatively linked to a 
transcription promoter and a second polypeptide-encoding region is operatively linked to an 
IRES. For example, the vector may contain coding sequences for two different 
30 heterologous proteins (e.g., the heavy and light chains of an immunoglobulin) or the coding 
sequences for all or a significant part of the genomic sequence for the gene from which the 
promoter driving expression of the transgene is derived, and the heterologous protein 
desired to be expressed (e.g., a construct containing the genomic coding sequences, 
including introns, of the avian lysozyme gene when the avian lysozyme promoter is used to 
35 drive expression of the transgene, an IRES, and the coding sequence for the heterologous 
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protein desired to be expressed downstream (i.e., 3' on the RNA transcript of the IRES). 
Thus, in certain embodiments, the nucleic acid encoding the heterologous protein is 
introduced into the 5' untranslated or 3' untranslated regions of an endogenous gene, such as 
but not limited to, ovalbumin, lysozyme, ovomucoid, ovotransferrin, conalbumin, and 
5 ovomucin, with an IRES sequence directing translation of the heterologous sequence. 

Such nucleic acid constructs, when inserted into the genome of a bird and expressed 
therein, will generate individual polypeptides that may be post-translationally modified, for 
example, glycosylated or, in certain embodiments, form complexes, such as heterodimers 
with each other in the white of the avian egg. Alternatively, the expressed polypeptides may 
10 be isolated from an avian egg and combined in vitro, or expressed in a non-reproductive 
tissue such as serum. In other embodiments, for example, but not limited to, when 
expression of both heavy and light chains of an antibody is desired, two separate constructs, 
each containing a coding sequence for one of the heterologous proteins operably linked to a 
promoter (either the same or different promoters), are introduced by microinjection into 
1 5 cytoplasm of one or more embryonic cells and transgenic avians harboring both transgenes 
in their genomes and expressing both heterologous proteins are identified. Alternatively, 
two transgenic avians each containing one of the two heterologous proteins (e.g., one 
transgenic avian having a transgene encoding the light chain of an antibody and a second 
transgenic avian having a transgene encoding the heavy chain of the antibody) can be bred 
20 to obtain an avian containing both transgenes in its germline and expressing both transgene 

encoded proteins, preferably in eggs. 

Recombinant expression vectors can be designed for the expression of the encoded 
proteins in eukaryotic cells. Useful vectors may comprise constitutive or inducible 
promoters to direct expression of either fusion or non-fusion proteins. With fusion vectors, 

25 a number of amino acids are usually added to the expressed target gene sequence such as, 
but not limited to, a protein sequence for thioredoxin, a polyhistidine, or any other amino 
acid sequence that facilitates purification of the expressed protein. A proteolytic cleavage 
site may further be introduced at a site between the target recombinant protein and the 
fusion sequence. Additionally, a region of amino acids such as a polymeric bistidine region 

30 may be introduced to allow binding of the fusion protein to metallic ions such as nickel 
bonded to a solid support, and thereby allow purification of the fusion protein. Once the 
fusion protein has been purified, the cleavage site allows the target recombinant protein to 
be separated from the fusion sequence. Enzymes suitable for use in cleaving the proteolytic 
cleavage site include, but are not limited to, Factor Xa and thrombin. Fusion expression 

3 5 vectors that may be useful in the present invention include pGex (AMRAD® Corp., 
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Melbourne, Australia), pRIT5 (PHARMACIA®, Piscataway, NJ) and pMAL (NEW 
ENGLAND BIOLABS®, Beverly, MA), fusing glutathione S-transferase, protein A, or 
maltose E binding protein, respectively, to the target recombinant protein. 

Once a promoter and a nucleic acid encoding a heterologous protein of the present 
5 invention have been cloned into a vector system, it is ready to be incorporated into a host 
cell. Such incorporation can be carried out by the various forms of transformation noted 
above, depending upon the vector/host cell system. It is contemplated that the incorporation 
of the DNA of the present invention into a recipient cell may be by any suitable method 
such as, but not limited to, viral transfer, electroporation, gene gun insertion, sperm- 
10 mediated transfer to an ovum, microinjection and the like. Suitable host cells include, but 
are not limited to, bacteria, virus, yeast, mammalian cells, and the like. In particular, the 
present invention contemplates the use of recipient avian cells, such as chicken cells or 
quail cells. 

Another aspect of the present invention, therefore, is a method of expressing a 
1 5 heterologous polypeptide in a eukaryotic cell by transfecting an avian cell with a 

recombinant DNA comprising an avian tissue-specific promoter operably linked to a nucleic 
acid insert encoding a polypeptide and, optionally, a polyadenylation signal sequence, and 
culturing the transfected cell in a medium suitable for expression of the heterologous 
polypeptide under the control of the avian lysozyme gene expression control region. 
20 Yet another aspect of the present invention is a eukaryotic cell transformed with an 

expression vector according to the present invention and described above. In one 
embodiment of the present invention, the transformed cell is a chicken oviduct cell and the 
nucleic acid insert comprises the chicken lysozyme gene expression control region, a 
nucleic acid insert encoding a human interferon ct2b and codon optimized for expression in 
25 an avian cell, and an SV40 polyadenylation sequence. 

In another embodiment, the transformed cell is a quail oviduct cell and the nucleic 
acid insert comprises the artificial avian promoter construct MDOT (SEQ ID NO.: 11) 
operably linked to an interferon-encoding sequence, as described in Example 23 below. 
In yet another embodiment of the present invention, a quail oviduct cell is 
30 transfected with the nucleic acid insert comprising the MDOT artificial promoter construct 
operably linked to an erythropoietin (EPO)-encoding nucleic acid, wherein the transfected 
quail produces heterologous erythropoietin. 



35 



-42- 



WO 03/0241W PCT/US02/3015f. 



5.2.1 PROMOTERS 

The vectors of the invention contain promoters that function in avian cells, 
preferably, that are tissue-specific and, in preferred embodiments, direct expression in the 
magnum or serum or other tissue such that expressed proteins are deposited in eggs, more 
5 preferably, that are specific for expression in the magnum. Alternatively, the promoter 
directs expression of the protein in the serum of the transgenic avian. Introduction of the 
vectors of the invention, preferably, generate transgenics that express the heterologous 
protein in tubular gland cells where it is secreted into the oviduct lumen and deposited, e.g. , 
into the white of an egg. In preferred embodiments, the promoter directs a level of 
10 expression of the heterologous protein in the egg white of eggs laid by GO and/or Gl chicks 
and/or their progeny that is greater than 5 ug, 10 ug, 50 ug, 100 ug, 250 ug, 500 ug or 750 
ug, more preferably greater than 1 mg, 2 mg, 5 mg,.10 mg, 20 mg, 50 mg, 100 mg, 200 mg, 
500 mg, 700 mg, 1 gram, 2 grams, 3 grams, 4 grams or 5 grams. Such levels of expression 
can be obtained using the promoters of the invention. 
15 In preferred embodiments, the promoters of the invention are derived from genes 

that express proteins present in significant levels in the egg white and/or the serum. For 
example, the promoter comprises regions of an ovalbumin, lysozyme, ovomucoid, 
ovotransferrin, conalbumin or ovomucin promoter or any other promoter that directs 
expression of a gene in an avian, particularly in a specific tissue of interest, such as the 
20 magnum or in the serum. Alternatively, the prompter used in the expression vector may be 
derived from that of the lysozyme gene that is expressed in both the oviduct and 
macrophages. Portions of two or more of these, and other promoters that function in avians, 
may be combined to produce effective synthetic promoter. 

The promoter may optionally be a segment of the ovalbumin promoter region that is 
25 sufficiently large to direct expression of the coding sequence in the tubular gland cells. 
Other exemplary promoters include the promoter regions of the ovalbumin, lysozyme, 
ovomucoid, conalbumin, ovotransferrin or ovomucin genes (for example, but not limited to, 
as disclosed in co-pending United States Patent Application Nos. 09/922,549, filed August 
3, 2001 and 10/1 14,739, filed April 1, 2002, both entitled "Avian Lysozyme Promoter", by 
30 Rapp, and United States Patent Application No. 09/998,716, filed November 30, 2001, 
entitled "Ovomucoid Promoter and Methods of Use," by Harvey et al, all of which are 
incorporated by reference herein in their entireties). Alternatively, the promoter may be a 
promoter that is largely, but not entirely, specific to the magnum, such as the lysozyme 
promoter. Other suitable promoters may be artificial constructs such as a combination of 
35 nucleic acid regions derived from at least two avian gene promoters. One such embodiment 
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of the present invention is the MDOT construct (SEQ ID NO: 1 1) comprising regions 
derived from the chicken ovomucin and ovotransferrin promoters, including but not limited 
to promoters altered, e.g., to increase expression, and inducible promoters, e.g., the tet* 
system. 

5 The ovalbumin gene encodes a 45 kD protein that is also specifically expressed in 

the tubular gland cells of the magnum of the oviduct (Beato, 1989, Cell 56:335-344). 
Ovalbumin is the most abundant egg white protein, comprising over 50 percent of the total 
protein produced by the tubular gland cells, or about 4 grams of protein per large Grade A 
egg (Gilbert, "Egg albumen and its formation" in Physiology and Biochemistry of the 

10 Domestic Fowl, Bell and Freeman, eds., Academic Press, London, New York, pp. 1291- 
1329). The ovalbumin gene and over 20 kb of each flanking region have been cloned and 
analyzed (Lai et at, 1978, Proc. Natl. Acad. Sci. USA 75:2205-2209; Gannon et al, 1979, 
Nature 278:428-424; Roop et a/.,1980, Cell 19:63-68; and Royal et a/.,1975, Nature 
279:125-132). 

1 5 The ovalbumin gene responds to steroid hormones such as estrogen, glucocorticoids, 

and progesterone, which induce the accumulation of about 70,000 ovalbumin mRNA 
transcripts per tubular gland cell in immature chicks and 100,000 ovalbumin mRNA 
transcripts per tubular gland cell in the mature laying hen (Palmiter, 1973, J. Biol. Chem. 
248:8260-8270; Palmiter, 1975, Cell 4:189-197). The 5' flanking region contains four 

20 DNAse I-hypersensitive sites centered at -0.25, -0.8, -3.2, and -6.0 kb from the transcription 
start site. These sites are called HS-L -H, -EL and -IV, respectively. Promoters of the 
invention may contain one, all, or a combination of HS-I, HS-IL HS-ffl and HS0IV. 
Hypersensitivity of HS-II and -m are estrogen-induced, supporting a role for these regions 
in hormone-induction of ovalbumin gene expression. 

25 HS-I and HS-II are both required for steroid induction of ovalbumin gene 

transcription, and a 1.4 kb portion of the 5' region that includes these elements is sufficient 
to drive steroid-dependent ovalbumin expression in explanted tubular gland cells (Sanders 
and McKnight, 1988, Biochemistry 27: 6550-6557). HS-I is termed the negative-response 
element ("NRE") because it contains several negative regulatory elements which repress 

30 ovalbumin expression in the absence of hormone (Haekers etal, 1995,M>/. Endo. 9:1113- 
1 126). Protein factors bind these elements, including some factors only found in oviduct 
nuclei suggesting a role in tissue-specific expression. HS-II is termed the steroid-dependent 
response element ("SDRE") because it is required to promote steroid induction of 
transcription. It binds a protein or protein complex known as Chirp-I. Chirp-I is induced by 

3 5 estrogen and turns over rapidly in the presence of cyclohexamide (Dean et al. , 1 996, Mol. 
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Cell. Biol. 16:2015-2024). Experiments using an explanted tubular gland cell culture 
system defined an additional set of factors that bind SDRE in a steroid-dependent manner, 
including aNFicB-like factor (Nordstrom etal, 1993, J. Biol. Chem. 268:13193-13202; 
Schweersand Sanders, 1991, J. Biol. Chem. 266: 10490-10497). 

5 Less is known about the function of HS-IH and HS-IV. HS-ffl contains a functional 

estrogen response element, and confers estrogen inducibility to either the ovalbumin 
proximal promoter or a heterologous promoter when co-transfected into HeLa cells with an 
estrogen receptor cDNA. These data imply that HS-ffl may play a functional role in the 
overall regulation of the ovalbumin gene. Little is known about the function of HS-IV, 

10 except that it does not contain a functional estrogen-response element (Kato et al, 1992, 
Cell 68: 731-742). 

In an alternative embodiment of the invention, transgenes containing constitutive 
promoters are used, but the transgenes are engineered so that expression of the transgene 
effectively becomes magnum-specific. Thus, a method for producing an exogenous protein 

15 in an avian oviduct provided by the present invention involves generating a transgenic avian 
having two transgenes in its tubular gland cells. One transgene comprises a first coding 
sequence operably linked to a constitutive promoter. The second transgene comprises a 
second coding sequence that is operably linked to a magnum-specific promoter, where 
expression of the first coding sequence is either directly or indirectly dependent upon the 

20 cellular presence of the protein expressed by the second coding sequence. 

Additional promoters useful in the present invention include inducible promoters, 
such as the tet operator and the metallothionein promoter which can be induced by 
treatment with tetracycline and zinc ions, respectively (Gossen et al., 1992, Proc. Natl. 
Acad. Sci. 89: 5547-5551 and Walden et al, 1987, Gene 61: 317-327; incorporated herein 

25 by reference in their entireties). 

5.2.1.1 CHICKEN LYSOZYME GENE EXPRESSION CONTROL 

REGION NUCLEIC ACID SEQUENCES 
The chicken lysozyme gene is highly expressed in the myeloid lineage of 
30 hematopoietic cells, and in the tubular glands of the mature hen oviduct (Hauser et al, 
\m,Hematol and Blood Transfusion 26: 175-178; Schutze/a/., 1978, Cold Spring 
Harbor Symp. Quart. Biol. 42: 617-624) and is therefore a suitable candidate for an efficiem 
promoter for heterologous protein production in transgenic animals. The regulatory region 
of the lysozyme locus extends over at least 12 kb of DNA 5' upstream of the transcription 
35 start site, and comprises a number of elements that have been individually isolated and 
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characterized. The known elements include three enhancer sequences at about -6.1 kb, -3.9 
kb, and -2.7 kb (Grewal et al, 1992, Mol. Cell Biol. 12: 2339-2350; Bonifer et al., 1996, J. 
Mol. Med. 74: 663-671), a hormone responsive element (Hecht etal, 1988, E.M.B.O.J. 7: 
2063-2073), a silencer element and a complex proximal promoter. The constituent 

5 elements of the lysozyme gene expression control region are identifiable as DNAase 1 
hypersensitive chromatin sites (DHS). They may be differentially exposed to nuclease 
digestion depending upon the differentiation stage of the cell. For example, in the 
multipotent progenitor stage of myelomoncytic cell development, or in erythroblasts, the 
silencer element is a DHS. At the myeloblast stage, a transcription enchancer located -6.1 

10 kb upstream from the gene transcription start site is a DHS, while at the later monocytic 
stage another enhancer, at -2.7 kb becomes DNAase sensitive (Huber et al, 1995, DNA and 

Cell Biol. 14: 397-402). 

This invention also envisions the use of promoters other than the lysozyme 
promoter, including but not limited to, a cytomegalovirus promoter, an ovomucoid, 
1 5 conalbumin or ovotransferrin promoter or any other promoter that directs expression of a 
gene in an avian, particularly in a specific tissue of interest, such as the magnum. 

Another aspect of the methods of the present invention is the use of combinational 
promoters comprising an artificial nucleic acid construct having at least two regions 
wherein the regions are derived from at least two gene promoters, including but not limited 
20 to a lysozyme, ovomucoid, coiialbumin or ovotransferrin promoter. In one embodiment of 
the present invention, the promoter may comprise a region of an avian ovomucoid promoter 
and a region of an avian oxotransferrin promoter, thereby generating the MDOT avian 
artificial promoter construct The avian MDOT promoter construct of the present invention 
has the nucleic acid sequence SEQ ID NO: 1 1 and is illustrated in Figure 7. This promoter 
25 is useful for allowing expression of a heterologous protein in chicken oviduct cells and may 
be operably linked to any nucleic acid encoding a heterologous polypeptide of interest 
including, for example, a cytokine, growth hormone, growth factor, enzyme, structural 
protein or the like. 

* 

30 5.2.2 MATRIX ATTACHMENT REGIONS 

In preferred embodiments of the invention, the vectors contain matrix attachment 
regions (MARs) that preferably flank the transgene sequences to reduce position effects on 
expression when integrated into the avian genome. In fact, 5' MARs and 3' MARs (also 
referred to as "scaffold attachment regions" or SARs) have been identified in the outer 
35 boundaries of the chicken lysozyme locus (Phi-Van et al, 1988, E.M.B.O.J. 7: 655-664; 
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Phi-Van, L. and Startling, W.H., 1996, Biochem. 35: 10735-10742). Deletion of a 1.32 kb 
or a 1 .45 kb halves region, each comprising half of a 5 1 MAR, reduces positional variation 
in the level of transgene expression (Phi-Van and Startling, supra). 

The 5' matrix-associated region (5 1 MAR), located about -11.7 kb upstream of the 

5 chicken lysozyme transcription start site, can increase the level of gene expression by 
limiting the positional effects exerted against a transgene (Phi-Van et al, 1988, supra). At 
least one other MAR is located 3' downstream of the protein encoding region. Although 
MAR nucleic acid sequences are conserved, little cross-hybridization is seen, indicating 
significant overall sequence variation. However, MARs of different species can interact 

10 with the aucleomatrices of heterologous species, to the extent that the chicken lysozyme 
MAR can associate with the plant tobacco nucleomatax as well as that of the chicken 
oviduct cells (Mlynarona et al, 1994, Cell 6: 417-426; von Kries et al, 1990, Nucleic Acids 

Res. 18: 3881-3885). 

Gene expression must be considered not only from the perspective of cis-regulatory 
15 elements associated with a gene, and their interactions with trans-acting elements, but also 
with regard to the genetic environment in which they are located. Chromosomal positioning 
effects (CPEs), therefore, are the variations in levels of transgene expression associated with 
different locations of the transgene within the recipient genome. An important factor 
governing CPE upon the level of transgene expression is the chromatin structure around a 
20 transgene, and how it cooperates with the cis-regulatory elements. The cis-elements of the 
lysozyme locus are confined within a single chromatin domain (Bonifer et al, 1996, supra; 
Sippel et al, pgs. 133-147 in Eckstein F. & Lilley DM J. (eds), "Nucleic Acids and 
Molecular Biology", Vol. 3, 1989, Springer. 

The lysozyme promoter region of chicken is active whentransfected into mouse 
25 fibroblast cells and linked to a reporter gene such as the bacterial chloramphenicol 
acetyhransferase (CAT) gene. The promoter element is also effective when transiently 
transfected into chicken promacrophage cells. In each case, however, the presence of a 5' 
MAR element increased positional independency of the level of transcription (Stief et al, 
1989, Nature 341 : 343-345; Sippel et al, pgs. 257 - 265 in Houdebine L.M (ed), 
30 "Transgenic Animals: Generation and Use"). 

The ability to direct the insertion of a transgene into a site in the genome of an 
animal where the positional effect is limited offers predictability of results during the 
development of a desired transgenic animal, and increased yields of the expressed product 
Sippel and Steif disclose, in U.S. Patent No. 5,73 1,178, which is incorporated by reference 
v. :„ ;„ ™„thr>ric tr, inm.K,. the. mmression of eenes introduced into eukaryotic 
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cells by flanking a transcription unit with scaffold attachment elements, in particular the 5' 
MAR isolated from the chicken lysozyme gene. Hie transcription unit disclosed by Sippel 
and Steif was an artificial construct that combined only the -6. 1 kb enhancer element and 
the proximal promoter element (base position -579 to +1 5) from the lysozyme gene. Other 

5 promoter associated elements were not included. However, although individual cis- 

regulatory elements have been isolated and sequenced, together with short regions flanking 
DNA, the entire nucleic acid sequence comprising the functional 5' upstream region of the 
lysozyme gene has not been determined in its entirety and therefore not employed as a 
functional promoter to allow expression of a heterologous transgene. 

10 Accordingly, vectors of the invention comprise MARs, preferably both 5' and 3' 

MARs that flank the transgene, including the heterologous protein coding sequences and the 
regulatory sequences. 

5.23 CODON-OPTIMIZED GENE EXPRESSION 
1 5 Another aspect of the present invention provides nucleic acid sequences encoding 

heterologous polypeptides that are codon-optimized for expression in avian cells, and 
derivatives and fragments thereof. When a heterologous nucleic acid is to be delivered to a 
recipient cell for expression therein, the sequence of the nucleic acid sequence may be 
modified so that the codons are optimized for the codon usage of me recipient species. For 
20 example, if the heterologous nucleic acid is transfected into a recipient chicken cell, the 
sequence of the expressed nucleic acid insert is optimized for chicken codon usage. This 
may be determined from the codon usage of at least one, and preferably more than one, 
protein expressed in a chicken cell. For example, the codon usage may be determined from 
the nucleic acid sequences encoding the proteins ovalbumin, lysozyme, ovomucoid, 
25 ovotransferrin, conalbumin, and ovomucin of chicken. Briefly, the DNA sequence for the 
target protein may be optimized using the BACKTRANSLATE® program of the Wisconsin 
Package, version 9.1 (Genetics Computer Group, Inc., Madison, WI) with a codon usage 
table compiled from the chicken (Gallus gallus) ovalbumin, lysozyme, ovomucoid, 
ovotransferrin, conalbumin, and ovomucin proteins. The template and primer 
30 oligonucleotides are then amplified, by any means known in the art, including but not 
limited to PCR with Pju polymerase (STRATAGENE®, La Jolla CA). 

In one exemplary embodiment of a heterologous nucleic acid for use by the methods 
of the present invention, a nucleic acid insert encoding the human interferon a2b 
polypeptide optimized for codon-usage by the chicken is microinjected into the cytoplasm 

35 
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of a stage 1 embryo. Optimization of the sequence for codon usage is useful in elevating the 

level of translation in avian eggs. 

It is contemplated to be within the scope of the present invention for any nucleic 
acid encoding a polypeptide to be optimized for expression in avian cells. It is further 
5 contemplated that the codon usage may be optimized for a particular avian species used as a 
source of the host cells. In one embodiment of the present invention, the heterologous 
polypeptide is encoded using the codon-usage of a chicken. 

5.2.4 SPECIFIC VECTORS OF THE INVENTION 
10 In a preferred embodiment, a transgene of the invention comprises a chicken, or 

other avian, lysozyme control region sequence which directs expression of the coding 
sequence within the transgene. A series of PCR amplifications of template chicken 
genomic DNA are used to isolate the gene expression control region of the chicken 
lysozyme locus. Two amplification reactions used the PCR primer sets 5pLMAR2 (5'- 
15 TGCCGCCTTCTTTGATATTC-3 ') (SEQ ID NO: 1) and LE-6.1kbrevl (5 1 - 
TTGGTGGTAAGGCCrrTTTG-3*) (SEQ ID NO: 2) (Set 1) and lys-6.1 (5'- 
CTGGCAAGCTGTCAAAAACA-3') (SEQ ID NO: 3) and LysElRev (5'- 
CAGCTCACATCGTCCAAAGA-3') (SEQ ID NO: 4) (Set 2). The amplified PCR 
products were united as a contiguous isolated nucleic acid by a third PCR amplification step 
20 with the primers SEQ ID NOS: 1 and 4. 

The isolated PCR-amplified product, comprising about 12 kb of the nucleic acid 
region 5' upstream of the native chicken lysozyme gene locus, was cloned into the plasmid 
pCMV-LysSPBFNMM. pCMV-LysSPIFNMM comprises a modified nucleic acid insert 
encoding a human interferon a2b sequence and an SV40 polyadenylation signal sequence 
25 (SEQ ID NO: 8) 3' downstream of the interferon encoding nucleic acid. The sequence SEQ 
ID NO: 5 of the nucleic acid insert encoding human interferon a2b was in accordance with 
avian cell codon usage, as determined from the nucleotide sequences encoding chicken 
ovalbumin, lysozyme, ovomucoid, ovotransferrin, conalbumin, and ovomucin. 

The nucleic acid sequence (SEQ ID NO: 6) (GenBank Accession No. AF405538) of 
30 the insert in pAVTJCR-Al 15.93.1 .2 is shown in Figures 1 A-E. The modified human 
interferon o2b encoding nucleotide sequence SEQ ID NO: 5 (GenBank Accession No. 
AF405539) and the novel chicken lysozyme gene expression control region SEQ ID NO: 7 
(GenBank Accession No. AF405540), shown in Figures 2 and 3 A-E, respectively. A 
polyadenylation signal sequence that is suitable for operably linking to the polypeptide- 

35 
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encoding nucleic acid insert is the SV40 signal sequence SEQ ID NO: 8, as shown in Figure 
4. 

The plasmid pAVIJCR-Al 15.93.1.2 was restriction digested with enzyme Fsel to 
isolate a 15.4 kb DNA containing the lysozyme 5' matrix attachment region (MAR) and the 
5 -12.0 kb lysozyme promoter during the expression of the interferon-encoding insert, as 
described in Example 17, below. Plasmid pHEilys was restriction digested with MluL and 
Xhol to isolate an approximately 6 kb nucleic acids, comprising the 3' lysozyme domain, the 
sequence of which (SEQ ID NO: 9) is shown in Figures 5A-C. The 15.4 kb and 6 kb 
nucleic acids were ligated and the 21 .4 kb nucleic acid comprising the nucleic acid sequence 

10 SEQ ED NO: 1 0 as shown in Figures 6 A- J was transformed into recipient STBL4 cells. 

The inclusion of the novel isolated avian lysozyme gene expression control region of 
the present invention upstream of a codon-optimized interferon-encoding sequence in 
pAVIJCR-Al 15.93.1.2 allowed expression of the interferon polypeptide in avian cells 
transfected by sperm-mediated transfection. The 3' lysozyme domain SEQ ID NO: 9, when 

15 operably linked downstream of a heterologous nucleic acid insert, also allows expression of 
the nucleic acid insert as described in Example 18, below. For example, the nucleic acid 
insert may encode a heterologous polypeptide such as the a2b interferon encoded by the 

sequence SEQ ID NO: 5. 

It is further contemplated that any nucleic acid sequence encoding a polypeptide may 

20 . be operably linked to the novel isolated avian lysozyme gene expression control region 
(SEQ ID NO: 7) and optionally operably linked to the 3' lysozyme domain SEQ ID NO. 9 so 
as to be expressed in a transfected avian cell. The plasmid construct pAVUCR-Al 1 5.93 . 1 2 
can be introduced into cultured quail oviduct cells by transfection. ELISA assays of the 
cultured media showed that the transfected cells synthesized a polypeptide detectable with 

25 anti-human interferon <x2b antibodies. 

The isolated chicken lysozyme gene expression control region (SEQ ID NO: 7) for 
use in the methods of the present invention comprises the nucleotide elements that are 
positioned 5' upstream of the lysozyme-encoding region of the native chicken lysozyme 
locus and which are necessary for the regulated expression of a downstream polypeptide- 

30 encoding nucleic acid. Wlule not wishing to be bound by any one theory, the inclusion of at 
least one 5' MAR sequence of or reference element in the isolated control region may 
confer positional independence to a transfected gene operably linked to the novel lysozyme 

gene expression control region. 

The isolated lysozyme gene expression control region (SEQ ID NO: 7) of the present 
35 invention is useful for reducing the chromosomal positional effect of a transgene operably 
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linked to the lysozyme gene expression control region and transfected into a recipient avian 
cell. By isolating a region of the avian genome extending from a point 5 ' upstream of a 5 ' 
MAR of the lysozyme locus to the junction between the signal peptide sequence and a 
polypeptide-encoding region, cis-regulatory elements are also included that may allow gene 
5 expression in a tissue-specific manner. The lysozyme promoter region of the present 
invention, therefore, will allow expression of an operably linked heterologous nucleic acid 
insert in a transfected avian cell such as, for example, an oviduct cell. 

It is further contemplated that a recombinant DNA of the present invention may 
further comprise the chicken lysozyme 3' domain (SEQ. ID NO: 9) linked downstream of 
10 the nucleic acid insert encoding a heterologous polypeptide. The lysozyme 3' domain (SEQ 
ID NO: 9) includes a nucleic acid sequence encoding a 3' MAR domain that may cooperate 
with a 5' MAR to direct the insertion of the construct of the present invention into the 
chromosome of a transgenic avian, or may act independently of the 5' MAR 

Fragments of a nucleic acid encoding a portion of the subject lysozyme gene 
15 expression control region may also be useful as an autonomous gene regulatory element that 
may itself be operably linked to a polypeptide-encoding nucleic acid. Alternatively, the 
fragment may be combined with fragments derived from other gene promoters, such as an 
avian ovalbumin, vomucoid, ovotransferrin, conalbumin or ovomucin promoter, thereby 
generating novel promoters having new properties or a combination of properties. As used 
20 herein, a fragment of the nucleic acid encoding an active portion of a lysozyme gene 

expression control region refers to a nucleotide sequence having fewer nucleotides than the 
nucleotide sequence encoding the entire nucleic acid sequence of the lysozyme gene 
expression control region, but at least 200 nucleotides. 

The present invention also contemplates the use of antisense nucleic acid molecules 
25 that are designed to be complementary to a coding strand of a nucleic acid (i. e., 
complementary to an endogenous DNA or an mRNA sequence) or, alternatively, 
complimentary to a 5' or 3' untranslated region of the mRNA and therefore useful for 
regulating the expression of a gene by the lysozyme promoter. 

Synthesized oligonucleotides can be produced in variable lengths when for example, 
30 non-naturally occurring polypeptide sequences are desired. The number of bases 

synthesized will depend upon a variety of factors, including the desired use for the probes or 
primers. Additionally, sense or anti-sense nucleic acids or oligonucleotides can be 
chemically synthesized using modified nucleotides to increase the biological stability of the 
molecule or of the binding complex formed between the anti-sense and sense nucleic acids. 
35 For example, acridine substituted nucleotides can be synthesized. Protocols for designing 
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isolated nucleotides, nucleotide probes, and/or nucleotide primers are well-known to those 
of ordinary skill, and can be purchased commercially from a variety of sources (e.g., 
SIGMA GENOSYS®, The Woodlands, TX or The Great American Gene Co., Ramona, 
CA). 

5 

5.2.5 RECOMBINANT EXPRESSION VECTORS 

A useful application of the novel promoters of the present invention, such as the 
avian lysozyme gene expression control region (SEQ ID NO: 7) or the MDOT promoter 
construct (SEQ ID NO: 1 1) is the possibility of increasing the amount of a heterologous 
10 protein present in a bird, especially a chicken, by gene transfer. In most instances, a 

heterologous polypeptide-encoding nucleic acid insert transferred into the recipient animal 
host will be operably linked with a gene expression control region to allow the cell to 
initiate and continue production of the genetic product protein. A recombinant DNA 
molecule of the present invention can be transferred into the extra-chromosomal or genomic 

15 DNA of the host 

Expression of a foreign gene in an avian cell permits partial or complete post- 
translational modification such as, but not only, glycosylation, and/or the formation of the 
relevant inter- or intra-chain disulfide bonds. Examples of vectors useful for expression in 
the chicken Gallus gallus include pYepSecl (Baldari et al, 1987, EMB.OJ., 6: 229-234; 

20 incorporated herein by reference in its entirety) and pYES2 (INVITROGEN® Corp., San 
Diego, CA). 

The present invention contemplates that the injected cell may transiently contain the 
injected DNA, whereby the recombinant DNA or expression vector may not be integrated 
into the genomic nucleic acid. It is further contemplated that the injected recombinant DNA 

25 or expression vector may be stably integrated into the genomic DNA of the recipient cell, 
thereby replicating with the cell so that each daughter cell receives a copy of the injected 
nucleic acid. It is still further contemplated for the scope of the present invention to include 
a transgenic animal producing a heterologous protein expressed from an injected nucleic 
acid according to the present invention. 

30 Heterologous nucleic acid molecules can be delivered to oocytes using the sperm- 

mediated transfection methods of the present invention. The nucleic acid molecule may be 
inserted into a cell to which the nucleic acid molecule (or promoter coding region) is 
heterologous (i e., not normally present). Alternatively, the recombinant DNA molecule 
may be introduced into cells which normally contain the recombinant DNA molecule or the 
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particular coding region, as, for example, to correct a deficiency in the expression of a 
polypeptide, or where over-expression of the polypeptide is desired. 

Another aspect of the present invention, therefore, is a method of expressing a 
heterologous polypeptide in an avian cell by transfecting the avian cell with a selected 
5 heterologous nucleic acid comprising an avian promoter operably linked to a nucleic acid 
insert encoding a polypeptide and, optionally, a polyadenylation signal sequence. The 
transfected cell, which may be an avian embryonic cell microinjected with a heterologous 
nucleic acid, will generate a transgenic embryo that after introduction into a recipient hen 
will be laid as a hard-shell egg and develop into a transgenic chick. 

1 0 In another embodiment of the present invention, the nucleic acid insert comprises 

the chicken lysozyme gene expression control region, a nucleic acid insert encoding a 
human interferon alb and codon optimized for expression in an avian cell, and a chicken 3' 
domain, i.e., downstream enhancer elements. 

In one embodiment of the present invention, the transgenic animal is an avian 

1 5 selected from a turkey, duck, goose, quail, pheasant, ratite, and ornamental bird or a feral 
bird. In another embodiment, the avian is a chicken and the heterologous polypeptide 
produced under the transcriptional control of the avian promoter is produced in the white of 
an egg. In yet another embodiment of the present invention, the heterologous polypeptide is 
produced in the serum of a bird. 

20 

53 HETEROLOGOUS PROTEINS PRODUCED BY TRANSGENIC 
AVIANS 

Methods of the present invention, providing for the production of heterologous 
protein in the avian oviduct (or other tissue leading to deposition of the protein into the egg) 

25 and the production of eggs containing heterologous protein, involve providing a suitable 
vector coding for the heterologous protein and introducing the vector into oocytes by sperm- 
mediated transfection such that the vector is integrated into the genome of the resulting 

' transgenic embryo. A subsequent step involves deriving a mature transgenic avian from the 
transgenic embryo produced in the previous steps by transferring the injected cell or cells 

30 into the infundibulum of a recipient hen; producing a hard shell egg from that hen; and 
allowing the egg to develop and hatch to produce a transgenic bird. 

A transgenic avian so produced from transgenic embryonic cells is known as a 
founder. Such founders may be mosaic for the transgene (in certain embodiments, the 
founder has 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 90%, 100% of the cells containing 

35 the. transgene. The invention further provides production of heterologous proteins in other 
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tissues of the transgenic avians. Some founders will carry the transgene in the tubular gland 
cells in the magnum of their oviducts. These birds will express the exogenous protein 
encoded by the transgene in their oviducts. If the exogenous protein contains the 
appropriate signal sequences, it will be secreted into the lumen of the oviduct and into the 
5 white of an egg. 

Some founders are germ-line founders. A germ-line founder is a founder that carries 
the transgene in genetic material of its germ-line tissue, and may also carry the transgene in 
oviduct magnum tubular gland cells that express the exogenous protein. Therefore, in 
accordance with the invention, the transgenic bird may have tubular gland cells expressing 
10 the exogenous protein and the offspring of the transgenic bird will also have oviduct 

magnum tubular gland cells that express the exogenous protein. Alternatively, the offspring 
express a phenotype determined by expression of the exogenous gene in a specific tissue of 
the avian. In preferred embodiments, the heterologous proteins are produced from 
transgenic avians that were not (or the founder ancestors were not) using a eukaryotic viral 

1 5 vector, or a retroviral vector. 

The present invention can be used to express, in large yields and at low cost, a wide 
range of desired proteins including those used as human and animal pharmaceuticals, 
diagnostics, and livestock feed additives. Proteins such as growth hormones, cytokines, 
structural proteins and enzymes, including human growth hormone, interferon, lysozyme, 

20 and p-casein, are examples of proteins that are desirably expressed in the oviduct and 
deposited in eggs according to the invention. Other possible proteins to be produced 
include, but are not limited to, albumin, a-1 antitrypsin, antithrombin m, collagen, factors 
VHT, DC, X (and the like), fibrinogen, hyaluronic acid, insulin, lactoferrin, protein C, 
erythropoietin (EPO), granulocyte colony-stimulating factor (G-CSF), granulocyte 

25 macrophage colony-stimulating factor (GM-CSF), tissue-type plasminogen activator (tPA), 
feed additive enzymes, somatotropin, and chymotrypsin Immunoglobulins and genetically 
engineered antibodies, including immunotoxins that bind to surface antigens on human 
tumor cells and destroy them, can also be expressed for use as pharmaceuticals or 
diagnostics. It is contemplated that immunoglobulin polypeptides expressed in avian cells 

30 following transfection by the methods of the present invention may include monomeric 
heavy and light chains, single-chain antibodies or multimeric immunoglobulins comprising 
variable heavy and light chain regions, ie., antigen-binding domains, or intact heavy and 
light immunoglobulin chains. 
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5.3.1 MULTIMERIC PROTEINS 

The invention, in preferred embodiments, provides methods for producing 
multimeric proteins, preferably immunoglobulins, such as antibodies, and antigen binding 
fragments thereof. 

5 In one embodiment of the present invention, the multimeric protein is an 

immunoglobulin, wherein the first and second heterologous polypeptides are an 
immunoglobulin heavy and light chains respectively. Illustrative examples of this and other 
aspects and embodiments of the present invention for the production of heterologous 
multimeric polypeptides in avian cells are fully disclosed in U.S. Patent Application No. 

10 09/877,374, filed June 8 3 2001, by Rapp, which is incorporated herein by reference in its 
entirety. In one embodiment of the present invention, therefore, the multimeric protein is an 
immunoglobulin wherein the first and second heterologous polypeptides are an 
immunoglobulin heavy and light chain respectively. Accordingly, the invention provides 
immunoglobulin and other multimeric proteins that have been produced by transgenic 

1 5 avians of the invention. 

In the various embodiments of this aspect of the present invention, an 
immunoglobulin polypeptide encoded by the transcriptional unit of at least one expression 
vector may be an immunoglobulin heavy chain polypeptide comprising a variable region or 
a variant thereof, and may further comprise a D region, a J region, a C region, or a 

20 combination thereof. An immunoglobulin polypeptide encoded by the transcriptional unit 
of an expression vector may also be an immunoglobulin light chain polypeptide comprising 
a variable region or a variant thereof, and may further comprise a J region and a C region. It 
is also contemplated to be within the scope of the present invention for the immunoglobulin 
regions to be derived from the same animal species, or a mixture of species including, but 

25 not only, human, mouse, rat, rabbit and chicken. In preferred embodiments, the antibodies 

are human or humanized. 

In other embodiments of the present invention, the immunoglobulin polypeptide 
encoded by the transcriptional unit of at least one expression vector comprises an 
immunoglobulin heavy chain variable region, an immunoglobulin light chain variable 
30 region, and a linker peptide thereby forming a single-chain antibody capable of selectively 
binding an antigen. 

Another aspect of the present invention provides a method for the production in an 
avian of an heterologous protein capable of forming an antibody suitable for selectively 
binding an antigen comprising the step of producing a transgenic avian incorporating at 
3 5 least one transgene, wherein the transgene encodes at least one heterologous polypeptide 
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selected from an immunoglobulin heavy chain variable region, an immunoglobulin heavy 
chain comprising a variable region and a constant region, an immunoglobulin light chain 
variable region, an immunoglobulin light chain comprising a variable region and a constant 
region, and a single-chain antibody comprising two peptide-linked immunoglobulin variable 

5 regions. Preferably, the antibody is expressed such that it is deposited in the white of the 
developing eggs of the avian. The hard shell avian eggs thus produced can be harvested and 
the heterologous polypeptide capable of forming or which formed an antibody can be 
isolated from the harvested egg. It is also understood that the heterologous polypeptides 
may also be expressed under the transcriptional control of promoters that allow for release 

1 0 of the polypeptides into the serum of the transgenic animal. Exemplary promoters for non- 
tissue specific production of a heterologous protein are the CMV promoter and the RS V 
promoter. 

In one embodiment of this method of the present invention, the transgene comprises 
a transcription unit encoding a first and a second immunoglobulin polypeptide operatively 

1 5 linked to a transcription promoter, a transcription terminator and, optionally, an internal 
ribosome entry site (IRES) (see, for example, U.S. Patent No. 4,937,190 to Palmenberg et 
al, the contents of which is incorporated herein by reference in its entirety). 

In an embodiment of this method of the present invention, the isolated heterologous 
protein is an antibody capable of selectively binding to an antigen. In this embodiment, the 

20 antibody may be generated within the serum of an avian or within the white of the avian egg 
by combining at least one immunoglobulin heavy chain variable region and at least one 
immunoglobulin light chain variable region, preferably cross-linked by at least one di- 
sulfide bridge. The combination of the two variable regions will generate a binding site 
capable of binding an antigen using methods for antibody reconstitution that are well known 

25 in the art 

It is, however, contemplated to be within the scope of the present invention for 
immunoglobulin heavy and light chains, or variants or derivatives thereof, to be expressed 
in separate transgenic avians, and therefore isolated from separate media including serum or 
eggs, each isolate comprising a single species of immunoglobulin polypeptide. The method 

30 may further comprise the step of combining a plurality of isolated heterologous 

immunoglobulin polypeptides, thereby producing an antibody capable of selectively binding 
to an antigen. In this embodiment, two individual transgenic avians may be generated 
wherein one transgenic produces serum or eggs having an immunoglobulin heavy chain 
variable region, or a polypeptide comprising such, expressed therein. A second transgenic 

3 5 animal, having a second transgene, produces serum or eggs having an immunoglobulin light 
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chain variable region, or a polypeptide comprising such, expressed therein. The 
polypeptides may be isolated from their respective sera and eggs and combined in vitro to 
generate a binding site capable of binding an antigen. 

Examples of therapeutic antibodies that can be used in methods of the invention 
5 include but are not limited to HERCEPTIN® (Trastuzumab) (Genentech, CA) which is a 
humanized anti-HER2 monoclonal antibody for the treatment of patients with metastatic 
breast cancer; REOPRO® (abciximab) (Centocor) which is an anti-glycoprotein Ilb/IHa 
receptor on the platelets for the prevention of clot formation; ZENAPAX® (daclizumab) 
(Roche Pharmaceuticals, Switzerland) which is an immunosuppressive, humanized anti- 
10 CD25 monoclonal antibody for the prevention of acute renal allograft rejection; 

PANOREX™ which is a murine anti-17-IA cell surface antigen IgG2a antibody (Glaxo 
Wellcome/Centocor); BEC2 which is a murine anti-idiotype (GD3 epitope) IgG antibody 
(ImClone System); IMC-C225 which is a chimeric anti-EGFR IgG antibody (ImClone 
System); VITAXIN™ which is a humanized anti-aVp3 integrin antibody (Applied 
15 Molecular Evoluuon/Medlmmune); Campath 1H/LDP-03 which is a humanized anti CD52 
IgGl antibody (Leukosite); Smart Ml 95 which is a humanized anti-CD33 IgG antibody 
(Protein Design Lab/Kanebo); RITUXAN™ which is a chimeric anti-CD20 IgGl antibody 
(IDEC Pharm/Genentech, Roche/Zettyaku); LYMPHOCIDE™ which is a humanized anti- 
CD22 IgG antibody (Immunomedics); ICM3 is a humanized anti-ICAM3 antibody (ICOS 
20 Pharm); EDEC-114. is a primatied anti-CD80 antibody (IDEC Pharm/Mitsubishi); 

ZEVALIN™ is a radiolabeUed murine anti-CD20 antibody (IDEC/Schering AG); IDEC- 
131 is a humanized anti-CD40L antibody (DDEC/Eisai); IDEC-151 is a primatized anti-CD4 
antibody (EDEQ; IDEC-152 is a primatized anti-CD23 antibody (IDEC/Seikagaku); 
SMART anti-CD3 is a humanized anti-CD3 IgG (Protein Design Lab); 5G1.1 is a 
25 humanized anti-complement factor 5 (C5) antibody (Alexion Pharm); D2E7 is a humanized 
anti-TNF-a antibody (CAT/BASF); CDP870 is a humanized anti-TNF-o Fab fragment 
(Celltech); IDEC-151 is a primatized anti-CD4 IgGl antibody (IDEC Pharm/SmithKline 
Beecham); MDX-CD4 is a human anti-CD4 IgG antibody (Medarex/Eisai/Genmab); 
CDP571 is a humanized anti-TNF-a IgG4 antibody (Celltech); LDP-02 is a humanized anti- 
30 a 4p7 antibody (LeukoSite/Genentech); OrthoClone OKT4A is a humanized anti-CD4 IgG 
antibody (Ortho Biotech); ANTOVA™ is a humanized anti-CD40L IgG antibody (Biogen); 
ANTEGREN™ is a humanized anti-VLA-4 IgG antibody (Elan); and CAT-152 is a human 
anti-TGF-P 2 antibody (Cambridge Ab Tech). 

35 
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5.3.2 PROTEIN RECOVERY 

The protein of the present invention may be produced in purified form by any known 
conventional technique. For example, chicken cells may be homogenized and centrifuged. 
The supernatant can then be subjected to sequential ammonium sulfate precipitation and 

5 heat treatment. The fraction containing the protein of the present invention is subjected to 
gel filtration in an appropriately sized dextran or polyacrylamide column to separate the 
proteins. If necessary, the protein fraction may be further purified by HPLC. In another 
embodiment, an affinity column is used, wherein the protein is expressed with a tag. 

Accordingly, the invention provides proteins that are produced by transgenic avians 

1 0 of the invention. In a preferred embodiment, the protein is produced and isolated from an 
avian egg. In another embodiment, the protein is produced and isolated from avian serum. 

5.4 PHARMACEUTICAL COMPOSITIONS 

The present invention further provides pharmaceutical compositions, formulations, 
1 5 dosage units and methods of administration comprising the heterologous proteins produced 
by the transgenic avians using methods of the invneion. Preferably, compositions of the 
invention comprise a prophylactically or therapeutically effective amount of a the 
heterologous protein, and a pharmaceutically acceptable carrier. 

The term "carrier" refers to a diluent, adjuvant, excipient, or vehicle with which a 
20 compound of the invention is a(lministered. Such pharmaceutical vehicles can be liquids, 
such as water and oils, including those of petroleum, animal, vegetable or synthetic origin, 
such as peanut oil, soybean oil, mineral oil, sesame oil and the like. The pharmaceutical 
vehicles can be saline, gum acacia, gelatin, starch paste, talc, keratin, colloidal silica, urea, 
and the like. In addition, auxiliary, stabilizing, thickening, lubricating and coloring agents 
25 may be used. When aclministered to a patient, the compounds of the invention and 
phannaceutically acceptable vehicles are preferably sterile. Water.is a preferred vehicle 
when the compound of the invention is administered intravenously. Saline solutions and 
aqueous dextrose and glycerol solutions can also be employed as liquid vehicles, 
particularly for injectable solutions. Suitable pharmaceutical vehicles also include 
30 excipients such as starch, glucose, lactose, sucrose, gelatin, malt, rice, flour, chalk, silica 
gel, sodium stearate, glycerol monostearate, talc, sodium chloride, dried skim milk, 
glycerol, propyleneglycol, water, ethanol and the like. The present compositions, if desired, 
can also contain minor amounts of wetting or emulsifying agents, or pH buffering agents. 
The present compositions can take the form of solutions, suspensions, emulsion, 
35 tablets, pills, pellets, capsules, capsules containing liquids, powders, sustained-release 
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formulations, suppositories, emulsions, aerosols, sprays, suspensions, or any other form 
suitable for use. In one embodiment, the pharmaceutical^ acceptable vehicle is a capsule 
(see e.g., U.S. Patent No. 5,698,155). Other examples of suitable pharmaceutical vehicles 
are described in "Remington: the Science and Practice of Pharmacy", 20th ed., by Mack 

5 Publishing Co. 2000. 

In a preferred embodiment, the heterologous proteins are formulated in accordance 
with routine procedures as a pharmaceutical composition adapted for intravenous 
administration to human beings. Typically, compounds of the invention for intravenous 
adrninistration are solutions in sterile isotonic aqueous buffer. Where necessary, the 
" 10 compositions may also include a solubilizing agent. Compositions for intravenous 

aaministration may optionally include a local anesthetic such as lignocaine to ease pain at 
the site of the injection. Generally, the ingredients are supplied either separately or mixed 
together in unit dosage form, for example, as a dry lyophilized powder or water free 
concentrate in a hermetically sealed container such as an ampoule or sachette indicating the 

15 quantity of active agent. Where the heterologous protein of the invention is to be 
adrninistered by infusion, it can be dispensed, for example, with an infusion bottle 
containing sterile pharmaceutical grade water or saline. Where the composition of the 
invention is administered by injection, an ampoule of sterile water for injection or saline can 
be provided so that the ingredients may be mixed prior to administration. 

20 Compositions for oral delivery may be in the form of tablets, lozenges, aqueous or 

oily suspensions, granules, powders, emulsions, capsules, syrups, or elixirs, for example. 
Orally administered compositions may contain one or more optional agents, for example, 
sweetening agents such as fructose, aspartame or saccharin; flavoring agents such as 
peppermint, oil of wintergreen, or cherry; coloring agents; and preserving agents, to provide 

25 a pharmaceutically palatable preparation. Moreover, where in tablet or pill form, the 
compositions may be coated to delay disintegration and absorption in the gastrointestinal 
tract thereby providing a sustained action over an extended period of time. Selectively 
permeable membranes surrounding an osmoticaily active driving compound are also 
suitable for orally administered compounds of the invention. In these later platforms, fluid 

30 from the environment surrounding the capsule is imbibed by the driving compound, which 
swells to displace the agent or agent composition through an aperture. These delivery 
platforms can provide an essentially zero order delivery profile as opposed to the spiked 
profiles of immediate release formulations. A time delay material such as glycerol 
monostearate or glycerol stearate may also be used Oral compositions can include standard 
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vehicles such as mannitol, lactose, starch, magnesium stearate, sodium saccharin, cellulose, 
magnesium carbonate, etc. Such vehicles are preferably of pharmaceutical grade. 

Further, the effect of the heterologous proteins may be delayed or prolonged by 
proper formulation. For example, a slowly soluble pellet of the compound may be prepared 

5 and incorporated in a tablet or capsule. The technique may be improved by making pellets 
of several different dissolution rates and filling capsules with a mixture of the pellets. 
Tablets or capsules may be coated with a film which resists dissolution for a predictable 
period of time. Even the parenteral preparations may be made long-acting, by dissolving or 
suspending the compound in oily or emulsified vehicles which allow it to disperse only 

1 0 slowly in the serum. 

5.5 TRANSGENIC AVIANS 

Another aspect of the present invention concerns transgenic avians, preferably 
chicken or quail, produced by methods of the invention described in section 5.1 infra, 

1 5 preferably by introducing a nucleic acid comprising a transgene into an avian oocyte by the 
sperm-mediated transfection methods of the present invention. In one embodiment, a 
heterologous nucleic acid introduced to an avian oocyte by sperm-mediated transfection, 
resulting in a transgenic embryo which is then allowed to develop, preferably, transferred 
into the reproductive tract of a recipient hen where it is encapsulated by natural egg white 

20 proteins and a natural egg shell, then it is incubated and hatched to produce a transgenic 
chick. The heterologous polypeptide or polypeptides encoded by the transgenic 
heterologous nucleic acid may be secreted into the oviduct lumen of the mature transgenic 
chicken and deposited as a constituent component of egg white. The resulting transgenic 
avian chick (i.e, the GO) will carry one or more desired transgene(s) some or all of its cells, 

25 preferably in its germ line. These GO transgenic avians can be bred using methods well 
known in the art to generate second generation (i.e., Gls) transgenic avians that carry the 
transgene, i.e., achieve germline transmission of the transgene. In preferred embodiments, 
the methods of die invention result in germline transmission, Le. , percentage of GOs that 
transmit the transgene to progeny (Gl s), that is greater than 5%, preferably, greater than 

30 10%, 20%, 30%, 40%, and, most preferably, greater than 50%, 60%, 70%, 80%, 90% or 
even 100%. In other embodiments, the efficiency of transgenesis (i.e., number of GOs 
containing the transgene) is greater than 2%, 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 
80% or 99%. 

The egg can be harvested after laying and before hatching of a chick, or further 
35 incubated to generate a cloned chick, optionally genetically modified. The cloned chick 
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may carry a transgene in all or most of its cells. After maturation, the transgenic avian may 
lay eggs that contain one or more desired heterologous protein(s). 

The cloned chick may also be a knock-in chick expressing an alternative phenotype 
or capable of laying eggs having an heterologous protein therein. The reconstructed egg 
5 may also be cultured to term using the ex ovo method described by Perry et al (supra). 

Following maturation, the transgenic avian and/or transgenic progeny thereof, may 
lay eggs containing one or more desired heterologous protein(s) expressed therein and that 
can be easily harvested therefrom. The Gl chicks, when sexually mature, can then be bred 
to produce progeny that are homozygous or heterozygous for the transgene. 
1 0 A transgenic avian of the invention may contain at least one transgene, at least two 

transgenes, at least 3 transgenes, at least 4 transgenes, at least 5 transgenes, and preferably, 
though optionally, may express the subject nucleic acid encoding a polypeptide in one or 
more cells in the animal, such as the oviduct cells of the chicken. In embodiments of the 
present invention, the expression of the transgene may be restricted to specific subsets of 
15 cells, tissues, or developmental stages utilizing, for example, cis-acting sequences that 
control expression in the desired pattern. Toward this end, it is contemplated that tissue- 
specific regulatory sequences, or tissue-specific promoters, and conditional regulatory 
sequences may be used to control expression of the transgene in certain spatial patterns. 
Moreover, temporal patterns of expression can be provided by, for example, conditional 
20 recombination systems or prokaryotic transcriptional regulatory sequences. The inclusion 
of a 5' MAR region, and optionally the 3' MAR on either end of the sequence, in the 
expression cassettes suitable for use in the methods of the present invention may allow the 
heterologous expression unit to escape the chromosomal positional effect (CPE) and 
therefore be expressed at a more uniform level in transgenic tissues that received the 
25 transgene by a route other than through germ line cells. 

The transgenes may, in certain embodiments, be expressed conditionally, e.g. , the 
heterologous protein coding sequence is under the control of an inducible promoter, such as 
a prokaryotic promoter or operator that requires a prokaryotic inducer protein to be 
activated. Operators present in prokaryotic cells have been extensively characterized in vivo 
30 and in vitro and can be readily manipulated to place them in any position upstream from or 
within a gene by standard techniques. Such operators comprise promoter regions and 
regions that specifically bind proteins such as activators and repressors. One example is the 
operator region of the lexA gene of E. coli to which the LexA polypeptide binds. Other 
exemplary prokaryotic regulatory sequences and the corresponding trans-activating 
35 prokaryotic proteins are disclosed by Brent and Ptashne in U.S. Patent No. 4,833,080 (the 
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contents of which is herein incorporated by reference in its entirety). Transgenic animals 
can be created which harbor the subject transgene under transcriptional control of a 
prokaryotic sequence or other activator sequence that is not appreciably activated by avian 
proteins. Breeding of this transgenic animal with another animal that is transgenic for the 
5 corresponding trans-activator can be used to activate of the expression of the transgene. . 
Moreover, expression of the conditional transgenes can also be induced by gene therapy-like 
methods wherein a gene encoding the trans-activating protein, e.g., a recombinase or a 
prokaryotic protein, is delivered to the tissue and caused to be expressed, such as in a cell- 
type specific manner. 

1 0 Transactivators in these inducible or repressive transcriptional regulation systems 

are designed to interact specifically with sequences engineered into the transgene. Such 
systems include those regulated by tetracycline ("tet systems"), interferon, estrogen, 
ecdysone, Lac operator, progesterone antagonist RU486, and rapamycin (FK506) with tet 
systems being particularly preferred (see, e.g., Gingrich and Roder, 1998, Annu. Rev. 

1 5 Neurosci. 21 : 377-405; incorporated herein by reference in its entirety). These drugs or 
hormones (or their analogs) act on modular transactivators composed of natural or mutant 
ligand-binding domains and intrinsic or extrinsic DNA binding and transcriptional 
activation domains. In certain embodiments, expression of the heterologous peptidecan be 
regulated by varying the concentration of the drug or hormone in medium in vitro or in the 

20 diet of the transgenic animal in vivo. 

In a preferred embodiment, the control elements of the tetracycline-resistance operon 
of E. coli is used as an inducible or repressive transactivator or transcriptional regulation 
system ("tet system") for conditional expression of the transgene. A tetracycline-controlled 
transactivator can require either the presence or absence of the antibiotic tetracycline, or one 

25 of its derivatives, e.g., doxycycline (dox), for binding to the tet operator of the tet system, 
and thus for the activation of the tet system promoter (Ptet). 

In a specific embodiment, atetracycline-repressed regulatable system (TrRS) is used 
(Agha-Mohammadi arid Lotze, 2000, /. Clin Invest 105(9): 1 177-83; Shockett et al , 1995, 
Proc. Natl Acad Sci. USA 92: 6522-26; and Gossen and Bujard, 1992, Proc. Natl. Acad 

30 Set USA 89: 5547-5 1 ; incorporated herein by reference in their entireties). 

In another embodiment, a reverse tetracycline-controlled transactivator, e.g., rtTA2 
S-M2, is used. rtTA2 S-M2 transactivator has reduced basal activity in the absence 
doxycycline, increased stability in eukaryotic cells, and increased doxycycline sensitivity 
(Urlinger et al, 2000, Proc. Natl Acad Sci. USA 97(14): 7963-68; incorporated herein by 

35 reference in its entirety). In another embodiment, the tet-repressible system described by 
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Wells et al. (1999, Transgenic Res. 8(5): 371-81; incorporated herein by reference in its 
entirety) is used. In one aspect of the embodiment, a single plasmid Tet-repressible system 
is used.. In another embodiment, the GAL4-UAS system (Ornitz et al, 1991, Proc. Natl. 
Acad. Sci. USA 88:698-702; Rowitch et al, 1999, J. Neuroscience 19(20):8954-8965; 
5 Wang et al, 1999, Proc. Natl Acad. Sci. USA 96:8483-8488; Lewandoski, 2001, Nature 
Reviews (Genetics) 2:743-755) or a GAL4-VP 16 fusion protein system (Wang et al, 1999, 
Proc. Natl Acad. Sci. USA 96:8483-8488) is used. 

In other embodiments, conditional expression of a transgene is regulated by using a 
recombinase system that is used to turn on or off the gene's expression by recombination in 
10 the appropriate region of the genome in which the potential drug target gene is inserted. 
The transgene is flanked by recombinase sites, e.g. , FRT sites. Such a recombinase system 
can be used to turn on or off expression a transgene (for review of temporal genetic 
switches and 'tissue scissors" using recombinases, see Hennighausen & Furth, 1999, Nature 
Biotechnol 17: 1062-63). Exclusive recombination in a selected cell type may be mediated 
15 by use of a site-specific recombinase such as Cre, FLP-wild type (wt), FLP-L or FLPe. 
Recombination may be effected by any art-known method, e.g., the method of Doetschman 
et al (1987, Nature 330: 576-78; incorporated herein by reference in its entirety); the 
method of Thomas et al, (1986, Cell 44: 419-28; incorporated herein by reference in its 
entirety); the Cre-loxP recombination system (Sternberg and Hamilton, 1981, J. Mol Biol 
20 150: 467-86; Lakso et al., 1992, Proc. Natl Acad Sci USA 89: 6232-36; which are both 
incorporated herein by reference in their entireties); the FLP recombinase system of 
Saccharomyces cerevisiae (O'Gorman et al, 1991, Science 251: 1351-55); the Cre-loxP- 
tetracycline control switch (Gossenand Bujard, 1992, Proc. Natl. Acad Set USA 89: 5547- 
51, incorporated herein by reference in its entirety); and ligand-regulated recombinase 
25 system (Kellendonk et al , 1 999, J. Mol Biol 285:1 75-82; incorporated herein by reference 
in its entirety). Preferably, the recombinase is highly active, e.g., the Cre-loxP or the FLPe 
system, and has enhanced thermostability (Rodriguez et al, 2000, Nature Genetics 25: 139- 
40; incorporated herein by reference in its entirety). 

In a specific embodiment, the ligand-regulated recombinase system of Kellendonk et 
30 al (1999, J. Mol Biol. 285: 175-82; incorporated herein by reference in its entirety) can be 
used. In this system, the ligand-binding domain (LBD) of a receptor, e.g., the progesterone 
or estrogen receptor, is fused to the Cre recombinase to increase specificity of the 
recombinase. 

In the case of an avian, a heterologous polypeptide or polypeptides encoded by the 
35 transgenic nucleic acid may be secreted into the oviduct lumen of the mature animal and 
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deposited as a constituent component of the egg white into eggs laid by the animal. It is 
also contemplated to be within the scope of the present invention for the heterologous 
polypeptides to be produced in the serum of a transgenic avian. 

A leaky promoter such as the CMV promoter may be operably linked to a transgene, 

5 resulting in expression of the transgene in all tissues of the transgenic avian, resulting in 
production of, for example, immunoglobulin polypeptides in the serum. Alternatively, the 
transgene may be operably linked to an avian promoter that may express the transgene in a 
restricted range of tissues such as, for example, oviduct cells and macrophages so that the 
heterologous protein may be identified in the egg white or the serum of a transgenic avian. 

1 0 Transgenic avians produced by the sperm-mediated transfection methods of the present 
invention will have the ability to lay eggs that contain one or more desired heterologous 

protein(s) or variant thereof. 

One embodiment of the present invention, therefore, is a transgenic avian produced 
by the sperm-mediated transfection methods of the present invention and having a 
15 heterologous polynucleotide sequence comprising a nucleic acid insert encoding a 

heterologous polypeptide and operably linked to an avian lysozyme gene expression control 
region, the gene expression control region comprising at least one 5' matrix attachment 
region, an intrinsically curved DNA region, at least one transcription enhancer, a negative 
regulatory element, at least one hormone responsive element, at least one avian CR1 repeat 
20 element, and a proximal lysozyme promoter and signal peptide-encoding region. 

Another embodiment of the present invention provides a transgenic avian further 
comprising a transgene with a lysozyme 3' domain. 

Accordingly, the invention provides transgenic avians produced by methods of the 
invention as described infra. In preferred embodiments, the transgenic avian contains a 
25 transgene comprising a heterologous peptide coding sequence operably linked to a promoter 
and, in certain embodiments, other regulatory elements. In more preferred embodiments, 
the transgenic avians of the invention produce heterologous proteins, preferably in a tissue 
specific manner, more preferably such that they are deposited in the serum and, most 
preferably, such mat the heterologous protein is deposited into the egg, particularly in the 
30 egg white. In preferred embodiments, the transgenic avians produce eggs con t a ining greater 
than 5 ug, 10 ug, 50 ug, 100 ug, 250 ug, 500 ug, or 750 ug, more preferably greater than 1 
mg, 2 mg, 5 mg, 10 mg, 20 mg, 50 mg, 100 mg, 200 mg, 500 mg, 700 mg, 1 gram, 2 grams, 
3 grams, 4 grams or 5 grams of the heterologous protein. In preferred embodiments, the 
transgenic avians produce an immunoglobulin molecule and deposit the immunoglobulin in 
35 the egg or serum of the avian, and preferably, the immunoglobulin isolated from the egg or 
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serum specifically binds its cognate antigen. The antibody so produced may bind the 
antigen with the same, greater or lesser affinity than the antibody produced in a mammalian 

cell, such as a myeloma or CHO cell. 

In specific embodiments, the transgenic avians of the invention were not produced 
5 or are not progeny of a transgenic ancestor produced using a eukaryotic viral vector, more 
particularly, not a retroviral vector (although, in certain embodiments, the vector may 
contain sequences derived from a eukaryotic viral vector, such as promoters, origins of 
replication, etc.). The transgenic avians of the invention include GO avians, founder 
transgenic avians, Gl transgenic avians, avians containing the transgene in the sperm or 

1 0 ova, avians mosaic for the Uansgene and avians containing copies of the transgene in most 
or all of the cells. Contemplated by the invention are transgenic avians in which the 
transgene is episomal. In more preferred embodiments, the transgenic avians have the 
transgene integrated into one or more chromosomes. Chromosomal integration can be 
detected using a variety of methods well known in the art, such as, but not limited to, 

1 5 Southern blotting, PCR, etc. 

6. F.XAMPLES 

The present invention is further illustrated by the following examples. Each 
example is provided by way of explanation of me invention, and is not intended to be a 

20 limitation of the invention. In fact, it will be apparent to those skilled in the art that various 
modifications, combination, additions, deletions and variations can be made in the present 
invention without departing from the scope or spirit of the invention. For instance, features 
illustrated or described as part of one embodiment can be used in another embodiment to 
yield a still further embodiment It is intended that the present invention covers such 

25 modifications, combinations, additions, deletions and variations as come within the scope of 
the appended claims and their equivalents. 

All references cited herein are incorporated herein by reference in their entirety and 
for all purposes to the same extent as if each individual publication, patent or patent 
30 application was specifically and individually indicated to be incorporated by reference in its 
entirety for all purposes. The citation of any publication is for its disclosure prior to the 
filing date and should not be construed as an admission that the present invention is not 
entitled to antedate such publication by virtue of prior invention 
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6.1 Example 1: Vectors Having Sperm-Specific Reporter Genes 
The specific activity of spermatogenesis-specific promoters, such as the protamine 
promoter necessary for post-meiotic-specific transcription of this gene may be used to 
selectively mark those sperm cells that have inherited the transgene of interest after meiotic 
5 segregation. 

The construct contains two separate elements. In one example, the first element 
comprises an oviduct-specific promoter, such as that associated with a gene encoding 
ovalbumin, lysozyme, ovomucoid, ovotransferrin, conalbumin or ovomucin, The promoter 
is operatively linked to, and therefore drives the expression of a gene coding for a desired 

1 0 heterologous protein of interest, such as, but not limited to, a therapeutic protein like 
interferon, erythropoietin (EPO), or an immunoglobulm. 

The second element, which can be located either upstream or downstream from the 
first element, contains the protamine promoter, or any fragment thereof that is sufficient to 
drive the expression of a marker gene encoding a vital and color marker, such as the Green 

15 Fluorescent Protein (GFP). Those sperm cells that incorporate the transgene into their 
genomic DNA are vitally labeled during the late stages of spenniogenesis by the expression 
of the GFP protein. Given that the construct contains both the above first and the second 
elements, positive sperm cells also contain the transgene of interest. 

Large numbers of positive sperm cells expressing the GFP protein are isolated using 

20 Fluorescent Activated Cell Sorting (FACS). Sperm cells selected on the basis of the 
expression of the incorporated marker gene are then used to breed hens by artificial 
insemination protocols. Suitable avian insemination protocols have been described by 
Etches (1996) Reprod in Poultry (CAB International, Wallingford, UK), incorporated 
herein by reference in its entirety. In those cases where the number of positive sperm 

25 obtained after FACS isolation is too low for the likelihood of successful artificial 

insemination, the females may be fertilized by the intramagnal insemination method of 
Engel (1991) Poult Sci. 70:1965-1969 orTrefil (1996) Br. Poult Sci. 37:661-664, 
incorporated herein by reference in their entireties. Alternatively, small numbers of positive 
sperm cells are isolated under a microscope using UV light and then microinjected into 

30 unfertilized eggs via the Intracytoplasmic Sperm Injection (ICSI) protocols of Perry (1999), 
incorporated herein by reference in its entirety. 

62 Example 2: Lipofection Gene Transfer to Avian Oocytes 
(a) Isolation of the ovum: Donor hens were inseminated using the protocol for 
35 avian artificial insemination described by Etches (1996), incorporated herein by reference in 

-66- 



WO 03/024199 



PCTYUS02/30I56 



its entirety. Fertilized ova were collected from the magnum region of the oviduct of 
euthanized birds 1.5-3 hours after oviposition. Alternatively, a hen whose oviduct is 
fistulated allows the collection of eggs for enucleation as taught by Gilbert and Woodgush, 
(1963, J. Reprod. Fertility 5: 451-453) and Pancer et al, (1989, Br. Poult Sci. 30: 953-7). 
5 The thick albumen capsule surrounding the ovum was removed using spatulas and the ovum 
was placed in a well 48mm diameter and 23 mm in height containing Perry's salt solution 
(see Perry (1988), incorporated herein by reference in its entirety). 

(b) Preparation of lipofection solutions: Two lipofection solutions were used. The 
first solution comprised 50ug/ml of LIPOFECT AMINE™ (Gibco) pre-incubated for 1 hour 

10 with the restriction endonuclease Not I (500 Units Not I per ml of lipofection solution), and 
designated herein as "Lipofectamine/Not I solution". The second lipofection solution was 
composed of 50ug/ml of LIPOFECTAMINE™ pre-incubated for 1 hour with 500^g of 
peGFP linearized with Not I per ml of lipofection solution, herein described as 
"Lipofectamine/peGFP solution." Lipofectin-treated eggs were then incubated for 1 hour. 

1 5 (c) Gene transfer to avian oocytes by lipofection: The isolated ovum was then 

placed inside a glass conical chamber (Figure 1 A) so that the blastodisc was located in the 
center of a window that opens at the narrower end of the conical chamber. A 40 mm 
diameter and 8 mm high glass dish was used at the bottom of the cone to close the system. 
Perry salt solution was added to the bottom of the dish to prevent drying of the lower half of 

20 the ovum. The Perry's salt solution overlaying the blastodisc (accessed through the window 
opening of the cone) was then replaced by, for example, 100 of a lipofection solution 
described below. The eggs were incubated for 1 hour. Alternatively, egg incubation can be 
done by adding the lipofection solutions to the well and inverting the position of the 
incubation chamber (Figure IB), or by using a cloning cylinder around the blastodisc 

25 (Figure 1C). 

(d) Transfer of the lipofected egg-. In a preferred embodiment, the ovum is 
surgically transferred into the oviduct of the recipient hen shortly after lipofection according 
to a described surgical procedure. (Tanaka, 1994, supra). The recipient hens are 
anesthetized by wing vein injection with pentobarbital (0.7 ml of a 68 mg/ml solution) or 

30 using gas anesthetics such as Isoflurane shortly after laying. During this window, the 

infundibulum is receptive to receiving a donor ovum but that has not yet ovulated. Feathers 
are removed from the abdominal area, the area is scrubbed with betadine and rinsed with 
70% ethanol. The bird is placed in a supine position and a surgical drape is placed over the 
bird exposing the surgical area. An incision is made be ginnin g at the junction of the sternal 

35 rib to the breastbone and running parallel to the breastbone. The length of the incision is 
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approximately 6cm. After cutting through the smooth muscle layers and the peritoneum, 
the infundibulum is located. The infundibulum is externalized and opened using gloved 
hands. The donor ovum is gently placed in the open infundibulum. Gravity facilitates the 
movement of the ovum through the infundibulum and into the anterior magnum. The 

5 internalized ovum is placed into the body cavity and the incision closed using interlocking 
stitches both for the smooth muscle layer and the skin. The recipient hen is returned to her 
cage and allowed to recover with free access to both feed and water. The hens resume 
normal activities after a post-operative recovery time of less than 45 minutes. Once 
transferred, the embryo develops inside the recipient hen and travels through the oviduct 

10 where it is encapsulated by natural egg white proteins and a natural eggshell. Eggs laid by 
the recipient hens are collected the next day, set, and incubated in a Jamesway incubator. 
The eggs hatch 21 days later. 

6.3 Example 3: Maintenance of Plasmid Linearization in the Remi 
15 Procedure 

A plasmid that is to be integrated into the genomic nucleic acid of a sperm is 
linearized by cleavage with a selected restriction endonuclease. The linearized nucleic acid 
is then dephosphorylated at the exposed 5 1 ends of the newly formed cohesive regions by 
alkaline phosphatase treatment. Suitable protocols for the alkaline phosphatase 
20 dephosphorylation of nucleic acids are disclosed, for example, by Sambrook et aL, (supra), 
incorporated herein by reference in its entirety. 

While not wishing to be bound by any one theory, it is believed that 
dephosphorylated cohesive ends of the nucleic acid may hybridize to recircularize the 
cleaved plasmid. Dephosphorylation of the 5' termini, however, prevent a DNA ligase from 
25 covalently rejoining a 5' terminus to the adjacent 3' terminus, thereby preventing a stable 
circular plasmid molecule from reforming. The cohesive ends of the non-ligated 
circularized plasmid may dissociate within a sperm cell to give a linearized nucleic acid that 
may integrate into the sperm genomic DNA. 

Alternatively, a circular plasmid having a heterologous nucleic acid that is to be 
30 integrated into the genomic nucleic acid of a sperm is digested with at least two different 
restriction endonucleases that generate a linearized plasmid having two non-cohesive ends, 
and wherein the desired transgenic element heterologous nucleic acid remains intact 
between the new termini of the cleaved plasmid. The restriction endonucleases are selected 
to give dissimilar cohesive ends that cannot hybridize together to recircularize the cleaved 
35 plasmid. The linearized nucleic acid is then delivered to the sperm with both of the 
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restriction endonucleases used to cleave the plasmid. The restriction endonucleases may be 
delivered to the sperm sequentially or simultaneously and combined, or sequentially 
delivered, with the cleaved plasmid. 

It can be advantageous, depending upon the positions of the endonuclease cleavage 
5 sites within the plasmid relative to the desired transgene, to use two different endonucleases 
that produce hybridizable cohesive ends. In this case, the 5' termini may also be 
dephosphorylated with alkaline phosphatase as described above, to prevent religation and 
stabilization of the cleaved plasmid. 

10 6.4 Example 4: Methods for Determing the SV40 Ori Requirement in SMT 

To determine the requirement for the SV40 origin of replication in sperm-mediated 
transgenesis, 5 \ig each of the plasmids pi 083 (with the CMY promoter controlling heavy 
chain transcription) and pi 086 (where the CMV promoter controls light chain transcription) 
were digested with Dra HI which excises the SV40 origin of replication from the pl083 

1 5 plasmid while retaining the SV40 origin of replication of the p 1 086 plasmid. For 

comparison, 5 fig each of the plasmids pi 083 and pi 086 were digested with the restriction 
endonuclease Mlu I that linearizes both plasmids while retaining the SV40 origin of 
replication in each of the respective plasmids. 

Digested plasmids were used to transfect sperm. In a polystyrene tube, Dra III- 

20 digested plasmids pi 086 and pi 083 (5 \xg of each) were added to 100 ul of OPTTMEM™ 
medium (Life Technologies, Gaithersburg, MD) and 1 0 fig of UPOFECTAMINE™ 
liposome (Life Technologies, Gaithersburg, MD). In a separate tube, 100 units of Dra HI 
restriction enzyme were added to 100 \i\ of OPTIMEM™ medium followed by 10 ug of 
LIPOFECTAMINE™. The tubes were incubated at room temperature for 30 minutes, then 

25 added to freshly collected semen containing 10 9 chicken sperm (approximately 300 |il of 
semen). Hie sperm, DNA-liposome, and restriction enzyme-liposome mixture was 
incubated at room temperature for 30 minutes. 

Two White Leghorn hens were then artificially inseminated with 250 pi each of the 
transfection mixture. Eggs were collected for 7 days starting on the second day after 

30 fertilization, and set for hatch. Two weeks after hatch, serum samples were collected and 
assayed for human monoclonal antibodies by ELISA. The results are shown in Figure 10, 
wherein wing band number 3932 is the control. 
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6.5 Example 5: Gamma Irradiation of Chicken Sperm 

Exogenous, linearized DNA can be integrated into the genome of a recipient sperm 
cell by cleaving the double-stranded genomic DNA by gamma irradiation of the sperm prior 
to lipofection thereof with the transgenic nucleic acid. 

5 Wooster et al found that rooster sperm irradiated with 12 Grays (Gy) of gamma 

irradiation resulted in about 43% residual fertility. (1977, Can. J. Genet Cytol 19, 437- 
446). Therefore, rooster semen will be irradiated with the following doses of gamma 
radiation: 0, 1, 5, 10, 15, and 20 Gy. A liposomal complex will consist of 10 ug of 
linearized DNA containing a promoter {e.g., CMV, ovalbumin, lysozyme, ovomucoid, 

10 ovotransferrin, conalbumin, and ovomucin, etc.) and transgene (e.g., IFN, erythropoeitin, 
human monoclonal antibody immunoglobulin heavy and light chains, and GM-CSF, etc.) 
and 10 u.g of LIPOFECT AMINE™ (Life Technologies, Gaithersburg, MD) will then be 
transfected into the irradiated sperm. After one hour, the irradiated and transfected sperm 
will be introduced into the hen by traditional artificial insemination procedures. Resulting 

1 5 laid eggs will be set and hatched, and transgene integration will be confirmed by Southern 
analysis of blood DNA. 

6.6 Examples 6: Ovum Transfer to a Laying Hen 

At the time of laying, recipient hens are anesthetized by wing vein injection with 
20 pentobarbital (0.7 ml of a 68 mg/ml solution) or by a gaseous anesthetic such as Isoflurane. 
Pentobarbital is the preferred anesthetic. At this time, the infiindibulum is receptive to 
receiving a donor ovum but has not yet ovulated. Feathers are removed from the abdominal 
area, and the area is scrubbed with betadine, and rinsed with 70% ethanol. The bird is 
placed in a supine position and a surgical drape is placed over the bird with the surgical area 
25 exposed. An mcision is niade beginnm^ 

and ninning parallel to the breastbone. The length of the incision is approximately two 
inches. After cutting through the smooth muscle layers and the peritoneum, the 
mfundibulum is located. The infundibulum is externalized and opened using gloved hands 
and the donor ovum is gently applied to the open inftmdibulum. The ovum is allowed to 
30 move into the infundibulum and into the anterior magnum by gravity feed. The internalized 
ovum is placed into the body cavity and the incision closed using interlocking stitches both 
for the smooth muscle layer and the skin. The recipient hen is returned to her cage and 
allowed to recover with free access to both feed and water. Recovery time for the bird to be 
up, moving and feeding is usually within 45 minutes of the operation's end. Eggs laid by 
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the recipient hens are collected the next day, set, and incubated. They will hatch 21 days 
later. 

6.7 Example 7: Generation of Transgenic Chickens by Sperm-Mediated 

5 Transfection of Heterologous Nucleic Acid 

Plasmid pRC/CMV-EGFP, 10 fig, was added to 100 ul of OPTIMEM™ medium 
(Life Technologies, Gaithersburg, MD) and 10 ^ig of LIP OFECT AMINE™ (Life 
Technologies, Gaithersburg, MD) liposomes, in a polystyrene tube. In a separate tube, 100 
units of Dra ffl restriction enzyme was added to 1 00 pg of OPTIMEM™ medium followed 

10 by 10 ugofLIPOFECTAMME™. As negative controls, plasmids pi 086 and pi 083 were 
used for pRC/CMV-EGFP in the transfection mixture. Tubes were incubated at room 
temperature for 30 minutes, then added to 10 9 freshly collected chicken sperm 
(approximately 300 ul of sperm). The sperm, DNA-liposome, and restriction enzyme- 
liposome mixture was incubated at room temperature for 30 minutes. 

15 Two White Leghorn hens were inseminated with the transfection mixture, each hen 

receiving approximately 250 ul of the transfection mixture. Eggs were collected for 7 days 
starting on the second day after fertilization, and set for hatch. 

Four days after hatching, blood drops from chicks were collected from leg veins 
with heparinized capillary tubes and placed on microscope slides. Blood smears were 

20 viewed with FITC illumination with an inverted microscope (Olympus 1X70, 100 watt 
mercury lamp, HQ-FTTC Band Pass Emission filter cube, excitation 480/40 nm, emission 
535/50 nm, and 20X phase contrast objective). Auto-fluorescence was assessed using a 
TRTTC filter (Olympus Modular B-MAX Filter cube, excitation 535/50 nm, emission 
610/75 nm). 

25 Two chicks that resulted from sperm transfected with pRC/CMV-EGFP had white 

blood cells showing green fluorescence. No fluorescence was seen when viewed with the 
TRTTC filter, indicating that the green fluorescence was not due to auto-fluorescence. None 
of the control chicks, derived from sperm transferred with control plasmids, had green 
fluorescence in their blood 

30 

6.8 Example 8: Sperm-Mediated Transfection of Japanese Quail Ova 
Prophetic Example 

Japanese Quail hens will be artificially inseminated with sperm transfected with 
vectors capable of expressing a-IFN, erythropoietin or a monoclonal antibody. ELISAs will 
35 be used to detect and measure the amount of an expressed transgene product in the animal's 



-71- 



WO 03/024199 PCT/US02/30156 



serum and egg. As little as 15pg of a-interferon (a-IFN) or erythropoietin can be detected 
by this procedure. 

To prepare the quail flock for artificial insemination, the females will be separated 
from the males. Once the isolated females are no longer laying fertile eggs and the males 
5 are consistently producing sufficient semen, the birds will be used for artificial insemination 
(A.I) procedures. 

Sperm mediated transgenesis (SMT) of the quail will be performed with two 
plasmid vectors, pRC/CMV-IFNMM-SV40 and pRC/CMV-EPOMM-SV40. Transgenesis 
resulting in the integration of, and expression from, a heterologous nucleic acid encoding ct- 

10 IFN has been used successfully in chickens with both viral-based and sperm-mediated 
transfer (SMT)-based systems. The second vector will carry the gene encoding for 
erythropoietin. This protein requires more extensive post-translational modification, i:e. 
four glycosylations, than does a-IFN. Both of the plasmid vectors will produce their 
respective expressed polypeptides in serum and in ovo. Assaying for a-IFN or EPO 

15 production in serum will begin at two weeks of age and egg production will occur shortly 
thereafter. SMT will be performed with vectors having immunoglobulin heavy and light 
chain under the expression control of a lysozyme promoter. 

About 50 chicks will be obtained from the SMT-A.I.'s. Based on results from our 
chicken SMT experiments, at least 2 to 4 transgenic quail for every 50 birds will be 

20 produced from the SMT-A.L's. 

6.9 Example 9: Preparation of Female and Male Japanese Quails for 
Sperm-Mediated Transfection by Artificial Insemination 

The birds used will be selected for their optimal age for fertility, according to the 
25 average life history of the quail as shown in Table 1 . 



Table 1 : Japanese Quail life history 



35 



Hatching 


16-17 days 


Sexual maturity 




Females 




Under current conditions 


48 days 


Under optimal conditions 


35-38 days 


Males 


35-42 days 


Optimal Fertility 




Females 


60-240 day (8-34 weeks) 


Males 


60-280 days (8-40 weeks) 
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10 



15 



25 



Declining Fertility 




Fertility declines 30-50% 


224 days (32 weeks) 


Gamete production cessation 




Females (eggs) 


1.5-2 years 

* 


Males Decreases 


1.5 years 




3 years 


Ceases 




Lifespan 




Females 


2.5-3 years 


Males 


3-5 years 



Females: Female Japanese Quail are separated from males and eggs inspected for 
fertilization over a 10 day period. Females will begin producing infertile eggs about 7 to 10 
days after removal from the males. 

Males: Males birds will be separated from females and conditioned for semen 
collection. The conditioning will be continued for 10 days. About 60% of males at their 
sexual peak will produce good semen, and consistently high volume semen producing birds 
will be progressively selected. 

(a) Semen Quality and Lipofection Optimization: The extracted quail sperm will be 
added to a diluent that is at a higher pH than is typically used with chicken sperm. Quail 

20 sperm, compared to chicken sperm, require a higher pH to maintain motility once collected 
from the animal, as reported by Holm. L. & Wishart G.J. in Animal Reprod. Sci. 54: 45-54 
(1998) and incorporated herein by reference in its entirety. A semen diluent having a pH of 
between about 8 and about 9 maintains motility better than does a pH of 7. 
Artificial insemination (A.I): Each hen will be artificial inseminated with a 25 ul dose 
containing 2.5 x 10 7 sperm per hen. Hens will be divided into Group 1 : 4 females 
inseminated with semen only; Group 2: 4 females inseminated with semen treated with 
LIPOFECT AMINE™; Group 3: 4 females sperm-mediated transfected with pCMV-IFN- 
SV40; Group 4: 4 females sperm-mediated transfected with pCMV-EPO-SV40. 
Since the average fertility of hens after artificial insemination is about 4 days, the hens will 
30 be inseminated twice a week to ensure delivery of a fresh supply of transfected semen. 
Transgenic positive birds will be mated to produce Gj chicks. The first three eggs of each 
bird will be screened for IFN, EPO or immunoglobulin polypeptide expression and the 
remaining eggs will be incubated to hatching. 

(b) Hatchling care: Newly hatched chicks will be grown at 105 0 F-l 10°F for the 
first 4-5 days. The temperature will then be reduced by 5°F after the first and second week 
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in 2° F increments. By the third week the house temperature will be sufficient. A 1 6/8 

• * * 

lighting schedule will also be used. 

6.10 Example 10: Quail Semen Collection 

5 The male bird is grasped so its breastbone rests in the palm of the right hand. The 

tail is positioned so the first two fingers of right hand lay on either side of the vent just 
below the legs. Holding the male in an almost vertical position, the left hand gently 
squeezes four times at the base of the cloaca to remove the foamy secretions of the glandula 
proctodealis (foam gland). The vent is wiped to remove traces of the foamy substance and 

10 to prevent contamination of the semen. The left hand maintains firm pressure against the 
base of the cloaca and gently pulls back on dorsal proctodeal wall to achieve erection. 

The first two fingers of the right hand gently massage the abdomen and apply 
moderate pressure just below the vent to force semen from the vas deferens into the 
copulatory organ. The semen will appear shortly thereafter. The viscous, pale yellow to 

15 white semen is collected with a 20 ul pipette and immediately diluted with 150 mM NaCl 
and 20mM N-tris[Hydroxymethyl]me^ acid (TES), at pH 8.0. 

6.11 Example 11: Lipofection of Quail Sperm 

Quail semen will be diluted, immediately after harvesting, to a concentration of 10 8 
20 sperm/ml in 150 mM NaCl and 20mM N-tris[Hya^xymethyl]methyl-2-aininoethane- 
sulfonic acid (TES), pH 8.0 buffer. Semen extender that is optimized for chicken sperm 
may not be used since it rapidly immobilizes quail semen within five minutes of contact 
The lipofection procedures used with quail spenn will be similar to those adopted 
for chicken lipofection, including REMI sperm mediated tranfections (SMT). With the 
25 chicken SMT procedure, artificial insemination is with approximately 6 x 10 8 sperm. Due 
to the limited amount of semen produced by male quail 1x10 s quail sperm will be used per 
hen. The lowest number of sperm that will still gives maximum insemination will be 
adopted. Typically, the DNA (l.Oug), restriction enzyme, LffOFECTAMINE™ (1.0>g) 
and sperm (10 8 ) will be incubated together at a ratio of / respectively for 30 minutes. All 
30 reactions will be carried out in OptiMEM™ medium (Gibco-BRL, Gaithersburg, MD). 

6.12 Integration of Adeno-Associated Virus (Aav) Inverted Terminal 
Repeats-Flanked Genes Introduced by Sperm-Mediated Transgenesis 

The chromosomal integration of plasmid DNA into the genome of an avian cell will 
35 be mediated by flanking the gene of interest and sequences related to its expression, with 
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AAV inverted terminal repeat (TTR) sequences. A method for gene delivery and integration 
of heterologous nucleic acid sequences into the genomic DNA of a mammalian cell is 
described by Solis et al. in U.S. Patent Serial No. 5,843,742 incorporated herein by 
reference in its entirety. A nucleic acid segment will also be included with the gene of 
5 interest that will result in the expression of the AAV Rep protein within the same cell. 
For example, a plasmid nucleic acid vector containing an expression cassette 
consisting of a CMV immediate early promoter driving the expression of human 
erythropoetin, will be flanked by AAV ITR sequences. This plasmid will be introduced by 
sperm-mediate transgenesis into targeted host cells together with a second nucleic acid 
1 0 vector plasmid. This second plasmid will include an expression cassette comprising the 
CMV immediate early promoter driving expression of the nucleic acid sequence encoding 
the AAV Rep 78 protein. Alternatively, a single nucleic acid vector comprising the 
expression cassette comprising the CMV immediate early promoter driving expression of 
the nucleic acid sequence encoding the AAV Rep 78 protein and the cassette expressing the 
1 5 gene of interest, such as erythropoetin, will be introduced together into an avian male gem 
cell. 

6.13 Example 13: DNA Construct Modification to Improve Germline 
Transmission of Trangenes 

20 Following genetic modification in vertebrates, a low percentage of offsprings 

derived from the founder animals are transgenic given the low number of germline cells that 
carry the transgene. As a result, costly and cumbersome breeding of the founder animals is 
required to expand the number of transgenic animals derived from the original founder 
animals. 

25 A number of articles (e.g., Peschon, 1 989, Ann. NY Acad Sci. 564: 1 86- 1 97; 

Peschon, 1987, PAHS 84: 5316-5319; Zambrowicz, 1993, PNAS 90: 5071; Braun, 1989, 
Gene Dev. 3:793-802; Rhim, 1995, Biol. Reprod 52:20-32) as well as patent application® 
(O 'Gorman et al., PCT Publication No. WO 99/10488) have identified and used the 
elements of the protamine promoter necessary for pbst-meibtic-specific transcription of this 

30 gene. Other spermiogenesis-specific promoters have also been described and used in the 
context of genetic manipulation (Sage, 1999, Mech. Dev. 80: 29-39; Vidal, 1998, Mol. 
Reprod Dev. 51: 274-280). In this example, we take advantage of the specific activity of 
these promoters to selectively mark those sperm cells that have inherited the transgene of 
interest after meiotic segregation. 

35 
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In the example described here, the construct would contain two independent 
elements. In a preferred example, the first element would comprise an oviduct-specific 
promoter, such as ovalbumin, lysozyme, ovomucoid, ovotransferrin, conalbumin, and 
ovomucin. The promoter would drive expression of a gene coding for a protein of interest, 

5 such as a therapeutic protein like Interferon, erythroprotin (EPO). Alternatively, 
constitutive promoters such as CMV or RS V may also be used. 

The second element, located up or downstream from the first, would contain the 
protamine promoter, or a segment of this promoter that is sufficient to drive the expression 
of a marker gene. In a preferred example, the protamine promoter would drive the 

10 expression of a marker, preferably a vital and color marker, such as the Green Fluorescent 
Protein (GFP). In such example, those sperm cells that have inherited the transgene would 
be vitally labeled during the late stages of spermiogenesis with the expression of the GFP 
protein. Given that the construct used contains both the first and the second elements 
described above, positive sperm cells would also contain the transgene of interest. 

15 Large numbers of positive sperm cells expressing the GFP proteins could be isolated 

using Fluorescent Activated Cell Sorting (FACS). These sperm cells could subsequently be 
used to breed hens by described artificial insemination protocols. (Etches, 1996, Mol 
Reprod Dev. 45:2918). In cases where the number of positive sperm after FACS isolation 
is low and insufficient for AI, the females could be bred through intramagnal insemination. 

20 (Engel, 1991, Poultry Sci. 70: 1965; Trefil, 1996, Br. Poult. Sci. 37: 661-664). 

Alternatively, small numbers of positive sperm cells could be isolated under a microscope 
using UV light and injected into unfertilized eggs via described Intracytoplasmic Sperm 
Injection (ICSI) protocols. (Perry, 1999, Science 284: 1180-83). 

25 6.14 Example 14 : Use of Chicken Centromeric and Telomeric Sequences to 

Create a Chicken Artificial Chromosome (ChAC) 

The Shemesh et al procedure (2000, Molecular Reproduction and Development 56: 
306-308) for introducing linearized plasmid DNA into chicken sperm appears to rely on 
vector sequences which include an SV40 origin of replication. It is possible that the 

30 exogenous DNA therefore replicates as an episome and would most likely be lost in 
subsequent cell divisions due to improper segregation at mitosis. To insure proper 
segregation at mitosis, chicken centromere and telomere sequences could be included in the 
transgenic construct Chicken centromere and telomere sequences could be obtained on a 
BAC (bacterial artificial chromosome) library clone from Texas A&M University or Martin 

35 Groenen at Wageningen Agricultural University, The Netherlands. The SV40 origin of 
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replication and the promoter (i.e. CMV, ovalbumin, lysozyme, ovomucoid, ovotransferrin, 
conalbumin, and ovomucin, etc.) and transgene (le. IFN, EPO, human monoclonal antibody 
heavy and light chains, GM-CSF, etc.) combination could be cloned into the BAC clone 
containing the chicken centromere. This BAC would therefore contain an origin of 
5 replication, a centromere, telomere, and the promoter/transgene combination which could be 
transfected into sperm with the Shemesh procedure. Due to the chicken centromere and 
telomeres, the construct would replicate and segregate as a chicken artificial chromosome 
(ChAC). 

10 6.15 Example 15: Construction of Lysozyme Promoter Plasmids 

The chicken lysozyme gene expression control region was isolated by PCR 
amplification. Ligation and reamplification of the fragments thereby obtained yielded a 
contiguous nucleic acid construct comprising the chicken lysozyme gene expression control 
region operably linked to a nucleic acid sequence optimized for codon usage in the chicken 
15 (SEQ ID NO: 5) and encoding a human interferon a2b polypeptide optimized for expression 
in an avian cell. 

White Leghorn Chicken {Gallus gallus) genomic DNA was PCR amplified using the 
primers 5pLMAR2 (SEQ ID NO: 1) and LE-6.1kbrevl (SEQ ID NO: 2) in a first reaction, 
and Lys-6. 1 (SEQ ID NO: 3) and LysElrev (SEQ ID NO: 4) as primers in a second reaction. 

20 PCR cycling steps were: denaturation at 94°C for 1 minute; annealing at 60°C for 1 minute; 
extension at 72°C for 6 minutes, for 30 cycles using TAQ PLUS PRECISION DNA 
polymerase (STRATAGENE®, LaJolla, CA). The PCR products from these two reactions 
were gel purified, and then united in a third PCR reaction using only 5pLMAR2 (SEQ ID 
NO: 1) and LysElrev (SEQ ID NO: 4) as primers and a 10-minute extension period. The 

25 resulting DNA product was phosphorylated, gel-purified, and cloned into the EcoR V 
restriction site ofHie vector pBLUESCRIPT® KS, resulting in the plasmid pl2.0-lys. 

pl2.0-lys was used as a template in a PCR reaction with primers 5pLMAR2 (SEQ 
ID NO: l)andLYSBSU 

(S'-CCCCCCCCTAAGGCAGCCAGGGGCAGGAAGCAAA-S 1 ) (SEQ ID NO: 5) and a 10 
30 minute extension time. The resulting DNA was phosphorylated, gel-purified, and cloned 

into the EcoR V restriction site of pBLUESCRIPT® KS, forming plasmid pl2.01ys-B. 

pl2.01ys-B was restriction digested with Not I and Bsu36 1, gel-purified, and cloned 

into Not I and Bsu36 1 digested pCMV-LysSPIFNMM, resulting in pl2.0-lys-LSPIFNMM 

pl2.0-lys-LSPIFNMM was digested with Sal I and the SalltoNotI primer (5'- 
35 TCGAGCGGCCGC-3 1 ) (SEQ ID NO: 13) was annealed to the digested plasmid, followed 
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by Not I digestion. The resulting 12.5 kb Not I fragment, comprising the lysozyme promoter 
region linked to IENMAGMAX-encoding region and an SV40 polyadenylation signal 
sequence, was gel-purified and ligated to Not I cleaved and dephosphorylated 
PBLUESCRIPT® KS, thereby forming the plasmid pAVIJCR-Al 15.93.1 .2, which was then 
5 sequenced. 

6.16 Example 16: Construction of Plasmids Which Contain the 3' Lysozyme 
Domain 

The plasmid pAVDCR-Al 15.93.1.2 (containing the-12.0kb lysozyme promoter 

1 0 controlling expression of human interferon a2b) was purified with a QIAGEN® Plasmid 
Maxi Kit (QIAGEN®, Valencia, CA), and 100 \ig of the plasmid were restriction digested 
with NoA restriction enzyme. The digested DNA was phenol/CHCl 3 extracted and ethanol 
precipitated. Recovered DNA was resuspended in ImM Tris-HCl (pH 8.0) and O.lmM 
EDTA, then placed overnight at 4°C. DNA was quantified by spectrophotometry and 

15 diluted to the appropriate concentration. The DNA samples were bound to the SV40 T 
antigen NLS peptide by incubation for 15 minutes. 

The plasmid pAVEFCR-Al 15.93.1.2 was restriction digested with Fsel and blunt- 
ended with T4 DNA polymerase. The linearized, blunt-ended pAVIJCR-Al 15.93.1.2 
plasmid was then digested with Jftol restriction enzyme, followed by treatment with 

20 alkaline phosphatase. The resulting 15.4 kb DNA band containing the lysozyme 5 f matrix 
attachment region (MAR) and -12.0 kb lysozyme promoter driving expression of a human 
interferon was gel purified by electroelution. 

The plasmid pDIilys was restriction digested with Mu\ then blunt-ended with the 
Klenow fragment of DNA polymerase. The linearized, blunt-ended pHfilys plasmid was 

25 digested with Xhol restriction enzyme and the resulting 6 kb band containing the 3' 
lysozyme domain from exon 3 to the 3' end of the 3' MAR was gel purified by 
electroelution. The 1 5. 4 kb band from pAVUCR- All 5. 93. 1.2 and the 6 kb band from 
pTTTilys were ligated with T4 DNA ligase and transformed into STBL4 cells (Invitrogen Life 
Technologies, Carlsbad, CA) by electroporation. The resulting 21 .3 kb plasmids from two 

30 different bacterial colonies were named pAVLFCR-A212.89.2.1 and pAVUCR-A212.89.2.3 
respectively. 



35 
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6.17 Example 17: Construction of an ALV-based Vector Having p-lactamase 
Encoding Sequences 

The lacZ gene of pNLB, a replication-deficient avian leukosis virus (ALV)-based 
vector (Cosset et aL, 1 Virol 65: 3388-94 (1991)), was replaced with an expression cassette 
5 consisting of a cytomegalovirus (CMV) promoter and the reporter gene ^-lactamase (fi-La 
or 51). 

To efficiently replace the lacZ gene of pNLB with a transgene, an intermediate 
adaptor plasmid was first created, pNLB-Adapter. pNLB-Adapter was created by inserting 
the chewed back ApaVApal fragment of pNLB (Cosset et aL, 1991, J. Virol 65:3388-94) 

1 0 (in pNLB, the 5 ' Apal sites reside 289 bp upstream of lacZ and the 3 ' Apal sites reside 3 ' of 
the 3' LTR and Gag segments) into the chewed-back KpnVSacl sites of pBluescript®KS(- 
). The filled-in MluVXbal fragment of pCMV-BL (Moore et al, Anal Biochem. 247: 203-9 
(1997)) was inserted into the chewed-back KpnVNdel sites of pNLB-Adapter, replacing 
lacZ with the CMV promoter and the BL gene (in pNLB, Kpnl resides 67 bp upstream of 

1 5 lacZ and Ndel resides 1 00 bp upstream of the lacZ stop codon), thereby creating pNLB- 
Adapter-CMV-BL. To create pNLB-CMV-BL, the HinW/BlpI insert of pNLB (containing 
lacZ) was replaced with the HindHUBlpl insert of pNLB-Adapter-CMV-BL. This two step 
cloning was necessary because direct ligation of blunt-ended fragments into the HindM/Blpl 
sites of pNLB yielded mostly rearranged subclones, for unknown reasons. 

20 

6.18 Example 18: Production of Transduction Particles Having an ALV- 
based Vector Having p-lactamase Encoding Sequences 
Sentas and Isoldes were cultured in F10 (GIBCO®), 5% newborn calf serum 
(GIBCO®), 1% chicken serum (GIBCO®), 50 ug/ml phleomycin (Cayla Laboratories) and 

25 50 jig/ml hygromycin (SIGMA®). Transduction particles were produced as described in 
Cosset et aL, 1991, herein incorporated by reference, with the following exceptions. Two 
days after transfection of the retroviral vector pNLB-CMV-BL (from Example 10, above) 
into 9 x 10 5 Sentas, virus was harvested in fresh media for 6-16 hours and filtered All of 
the media was used to transduce 3 x 10 6 Isoldes in three 100 mm plates with polybrene 

30 added to a final concentration of 4 ug/ml. The following day the media was replaced with 
media containing 50 ug/ml phleomycin, 50 ug/ml hygromycin and 200 ug/ml G418 
(SIGMA®). After 10-12 days, single G418 r colonies were isolated and transferred to 24- 
well plates. After 7-10 days, titers from each colony was determined by transduction of 
Sentas followed by G41 8 selection. Typically 2 out of 60 colonies gave titers at 1-3 x 10 5 . 

35 Those colonies were expanded and the virus concentrated to 2-7 x 10 7 as described in 
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Allioli et al 9 1994, Dev. Biol. 165:30-7, herein incorporated by reference. The integrity of 
the CMV-BL expression cassette was confirmed by assaying for P-lactamase in the media of 
cells transduced with NLB-CMV-BL transduction particles. . 

5 6.19 Example 19: pNLB-CMV-IFN Vector Having an IFN Encoding 

Sequence 

The DNA sequence for human interferon a2b based on hen oviduct optimized codon 
usage was created using the BACKTRANSLATE program of the Wisconsin Package, 
version 9.1 (Genetics Computer Group. Inc., Madison, WI) with a codon usage table 
1 0 compiled from the chicken (Gallus gallus) ovalbumin, lysozyme, ovomucoid, and 

ovotransferrin proteins. The template and primer oligonucleotides (SEQ ID NOS: 14-31) 
shown in Figures 8A-B were amplified by PCR with Pfu polymerase (STRATAGENE®, La 
Jolla, CA) using 20 cycles of 94°C for 1 min., 50°C for 30 sec, and 72°C for 1 min. and 10 
sec. 

15 PCR products were purified from a 12% polyacrylamide-TBE gel by the "crush and 

soak" method (Maniatis et al 1982), then combined as templates in an amplification 
reaction using only EFN-1 (SEQ ID NO: 21) and IFN-8 (SEQ ID NO: 31) as primers. The 
resulting PCR product was digested with Hind III and Xba I and gel purified from a 2% 
agarose-TAE gel, then ligated into Hind HI w&Xba I digested, alkaline phosphatase-treated, 

20 pBluescript® KS (STRATAGENE®), resulting in the plasmid pBluKSP-IFNMagMax. 
Both strands were sequenced by cycle sequencing on an ABI PRISM 377 DNA Sequencer 
(Perkin-Elmer, Foster City, CA) using universal T7 or T3 primers. Mutations in pBluKSP- 
IFN derived from the original oligonucleotide templates were corrected by site-directed 
mutagenesis with the Transformer Site-Directed Mutagenesis Kit (Clontech, Palo Alto, 

25 CA). The interferon coding sequence was then removed from the corrected pBluKSP-IFN 
with Hind IH and Xba 1, purified from a 0.8% agarose-TAE Gel, and ligated to Hind IE and 
Xba I digested, alkaline phosphatase-treated pCMV-BetaLa-3B-dH. The resulting plasmid 
was pCMV-IFN which contained IFN coding sequence controlled by the cytomegalovirus 
immediate early promoter/enhancer and SV40 polyA site. 

30 To clone the IFN coding sequence controlled by the CMV promoter/enhancer into 

the NLB retroviral plasmid, pCMV-IFN was first digested with C/al an&Xbal, then both 
ends were filled in with Klenow fragment of DNA polymerase (New England BioLabs, 
Beverly, MA). pNLB-adapter was digested with Nde I and Kpn I, and both ends were made 
blunt by T4 DNA polymerase (New England BioLabs). Appropriate DNA fragments were 

35 
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purified on a 0.8% agarose-TAE gel, then ligated and transformed into DH5a ceUs. The 
resulting plasmid was pNLB-adapter-CMV-IFN. 

This plasmid was then digested with Mlu I and partially digested with Blp I and the 
appropriate fragment was gel purified. pNLB-CMV-EGFP was digested with Mlu I and Blp 
5 I, then alkaline-phosphatase treated and gel purified. The Mlu VBlp I partial fragment of 
pNLB-adapter-CMV-IFN was ligated to the large fragment derived from the Mlu VBlp I 
digest of pNLB-CMV-EGFP, creating pNLB-CMV-IFN. 

6.20 Example 20: Production of pNLB-CMV-IFN Transduction Particles 

1 0 Senta packaging cells (Cosset et a/., 1991) were plated at a density of 3 x 1 0 5 

cells/35 mm tissue culture dish in F-10 medium (Life Technologies) supplemented with 
50% calf serum (Atlanta Biologicals), 1% chicken serum (Life Technologies), 50 ng/ml 
hygromycin (SIGMA®), and 50 jig/ml phleomycin (CAYLA, Toulouse, France). These 
cells were transfected 24h after plating with 2 \xg of CsCl-purified pNLB-CMV-IFN DNA 

15 and 6 jil of Lipofectin liposomes (Life Technologies) in a final volume of 500 \il Optimem 
(Life Technologies). The plates were gently rocked for four hours at 37° C in a 5% C0 2 
incubator. For each well, the media was removed, washed once with 1 ml of Optimem and 
re-fed with 2 mis of F-10 medium supplemented with 50% calf serum, 1% chicken serum, 
50 Mg/ml hygromycin, and 50 Mg/ml phleomycin. The next day, medium from transfected 

20 Sentas was recovered and filtered through a 0.45 micron filter. 

This medium was then used to transduce Isolde cells. 0.3 ml of the filtered medium 
recovered from Senta cells was added to 9.6 ml of F-10 (Life Technologies) supplemented 
as described above, in addition to polybrene (SIGMA®) at a final concentration of 4 jig/ml. 
This mixture was added to 10 6 Isolde packaging cells (Cosset et al. , 1991) plated on a 

25 1 00mm dish the previous day, then replaced with fresh F-10 medium (as described for Senta 

. growth) 4 hours later. 

The next day, the medium was replaced with fresh medium which also contained 
200 |ig/ml neomycin (G4 18, SIGMA®). Every other day, the medium was replaced with 
fresh F-10 medium supplemented with 50% calf serum, 1% chicken serum, 50 /zg/ml 
30 hygromycin, 50 pg/ml phleomycin, and 200 Mg/ml neomycin. Eleven to twelve days later, 
single colonies were visible by eye, and these were picked and placed into 24 well dishes. 
When some of the 24 well dishes became confluent, medium was harvested and titered to 
determine the cell lines with the highest production of retrovirus. 

Titering was performed by plating 7.5 x 1 0 4 Senta cells per well in 24 well plates on 
35 the day prior to viral harvest and transduction. The next day 1 ml of fresh F-l 0 medium 
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supplemented with 50% calf serum, 1% chicken serum, 50 jzgfail hygromycin, and 50 
Aig/ml phleomycin was added to each well of the isolated Isolde colonies. Virus was 
harvested for 8-10 hours. The relative density of each well of Isoldes was noted. After 8-10 
hours, 2 and 20^1 of media from each well of Isoldes was added directly to the media of 
5 duplicate wills of the Sentas. Harvested medium was also tested for the presence of 
interferon by EFN ELISA and for interferon bioreactivity. The next day the media was 
replaced with F-10 medium supplemented with 50% calf serum, 1% chicken serum, 50 
Atg/ml hygromycin, 50 Mg/ml phleomycin, and 200 Mg/ml neomycin. When obvious 
neomycin-resistant colonies were evident in the wells of transduced Sentas, the number of 

10 colonies was counted for each well. 

The Isolde colony producing the highest titer was determined by taking into account 
the number of colonies and correcting for the density of the Isolde cells when the viral 
particles were harvested (i.e., if two Isolde colonies gave rise to media with the same titer, 
but one was at a 5% density and the other was at a 50% density at the time of viral harvest, 

1 5 the one at the 5% density was chosen for further work, as was the case in the present 
example). 

The Isolde cell line producing the highest titer of IFN-encoding transducing particles 
was scaled up to six T-75 tissue culture flasks. When flasks were confluent, cells were 
washed with F-10 medium (unsupplemented) and transducing particles were then harvested 

20 for 16 hours in 14 ml/flask of F-10 containing 1% calf serum (Atlanta Biologicals) and 0.2% 
chicken serum (Life Technolocyies). Medium was harvested, filtered through a 0.45 micron 
syringe filter, then centrifuged at 195,000xg in a Beckman 60Ti rotor for 35 min. Liquid 
was removed except for 1 ml, and this was incubated with the pellet at 37°C with gentle 
shaking for one hour. Aliquots were frozen at -70°C. Transducing particles were then 

25 titered on Senta cells to determine concentrations used to inject avian spenns. 



6.21 Example 21; Construction of Lysozyme Promoter Plasmids 

The chicken lysozyme gene expression control region isolated by PCR amplification 
is folly disclosed in U.S. Patent Application Serial No. 09/922,549, filed August 3, 2001 

30 and incorporated herein by reference in its entirety. Ligation and ^amplification of the 
fragments thereby obtained yielded a functionally contiguous nucleic acid construct 
comprising the chicken lysozyme gene expression control region operably linked to a 
nucleic acid sequence encoding a human interferon a2b polypeptide and optimized for 
codon usage in the chicken. Briefly, chicken (Gallus gallus (White Leghorn)) genomic 

35 DNA was PCR amplified using the primers 5pLMAR2 and LE-6. Ikbrevl in a first 
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reaction, and Lys-6.1 and LysElrev as primers in a second reaction. PCR cycling steps 
were: denaturation at 94°C for 1 minute; annealing at 60°C for 1 minute; extension at 72°C 
for 6 minutes, for 30 cycles using TAQ PLUS PRECISION™ DNA polymerase 
(STRATAGENE®, La Jolla, CA). The PCR products from these two reactions were gel 

5 purified, and then united in a third PCR reaction using only 5pLMAR2 and LysElrev as 
primers and a 10 minute extension period. The resulting DNA product was phosphorylated, 
gel-purified, and cloned into the EcoR V restriction site of the vector pBLUESCRIPT® KS, 
resulting in the plasmid pl2.0-lys. 

pl2.0-lys was used as a template in a PCR reaction with primers 5pLMAR2 and 

10 LYSBSU and a 10 minute extension time. The resulting DNA was phosphorylated, gel- 
purified, and cloned into the EcoR V restriction site of PBLUESCRIPT® KS, forming plasmid 
pl2.01ys-B. 

pl2.01ys-B was restriction digested with Not I and Bsu36 1, gel-purified, and cloned 
into Not I and Bsu36 1 digested pCMV-LysSPIFNMM, resulting in pl2.0-lys-LSPIFNMM. 
15 pl2.0-lys-LSPIFNMM was digested with Sal I and the SalltoNotI primer was annealed to 
the digested plasmid, followed by Not I digestion. The resulting 12.5 kb Not I fragment, 
comprising the lysozyme promoter region linked to IFNMAGMAX-encoding region and an 
SV40 polyadenylation signal sequence, was gei-purified and ligated to Not I cleaved and 
dephosphorylated PBLUESCRIPT® KS, thereby forming the plasmid pAVIJCR-Al 15.93.1.2. 

20 

622 Example 22: Complete Lysozyme Promoter and IFNMAGMAX 
Sequences 

The complete sequences of the lysozyme gene promoter and the codon-optimized 
human interferon ot2b nucleic acid are fully disclosed in U.S. Patent Application No. 

25 09/922,549, filed 03 August 2001 and incorporated herein by reference in its entirety. The 
complete nucleotide sequence of the approximately 12.5 kb chicken lysozyme promoter 
region/ffNMAGMAX construct spans the 5' matrix attachment region (5' MAR), through 
the lysozyme signal peptide, to the sequence encoding the gene IFNMAGMAX and the 
subsequent polyadenylation signal sequence. The IFNMAGMAX nucleic acid sequence 

30 had been synthesized as described in Example 21 above. The expressed IFN a2b sequence 
within plasmid pAVUCR-Al 15.93.1.2 functioned as a reporter gene for lysozyme promoter 
activity. This plasmid construct may also be used for production of interferon a2b in the 
egg white of transgenic chickens. 
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6.23 Example 23: Synthesis of the MDOT promoter construct 

Amplification of the ovomucoid and ovotransferrin promoter sequences 

Oligonucleotide primers 1 (SEQ ID NO: 32) and 2 (SEQ ID NO: 33), as shown in 
Figure 9 were used to amplify the ovomucoid sequences. Oligonucleotide primers 3 (SEQ 

5 ID NO: 34) and 4 (SEQ ID NO: 35) were used to amplify the ovotransferrin sequence by 
PCR. The primers were designed such that the PCR-amplified ovomucoid sequences 
contained an Xho I restriction cleavage site at the 5' end and a Cla I site at the 3 ' end. 
Similarly, the PCR-amplified ovotransferrin product had a Cla I restriction site at the 5' end 
and a Hind ffl site at the 3 ' end. The overlapping Cla I site was used to splice the two-PCR 

1 0 products to create the MDOT promoter construct. The nucleic acid sequence SEQ ID NO: 
1 1 of the MDOT promoter construct is shown in Figure 1 1. The final product was cloned in 
a bluescript vector between the Xho I and Hind III sites. From the bluescript vector the 
promoter region was released by Kpn VHind III restriction digestion and cloned into the prc- 
CMV-IFN vector to replace the CMV promoter to create MDOT-IFN (clone #10). This 

1 5 plasmid was tested in vitro . 

6.24 Example 24: Testicular Injection 

5 weeks old White Leghorn male chickens were anesthetized using Isoflourane. 
Small incision was made between the last two ribs to expose the testes. A 5-10 \il virus 
suspension of pLNHX-CMV-EGFP/VSVg (9 x 10 6 per ml) was injected into either both 

testes or only one of the testes. 

At 20 weeks of age, semen samples were collected. Only one bird had sperm in his 
semen. Genomic DNA was isolated from the semen and used to amplify the transgene 
(CMV-EGFP) by PCR reaction using different DMSO concentrations. The samples were 
separated on agarose gel, transferred onto nitrocellulose membrane and hybridized with 
EGFP probe. As shown in Figure 11, EGFP positive bands are detected at two different 
DMSO concentrations suggesting that (1) specific PCR conditions are required for the 
amplification of the transgene and (2) the sperm samples have incorporated the transgene in 
their genome. 

EQUIVALENTS 

Reference now will be made in detail to the various embodiments of the invention, 
one or more examples of which are illustrated in the accompanying drawings. Each 
example is provided by way of explanation of the invention, not limitation of the invention. 
In fact, it will be apparent to those skilled in the art that various modifications, 
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combinations, additions, deletions and variations can be made in the present invention 
without departing from the scope or spirit of the invention. For instance, features illustrated 
or described as part of one embodiment can be used in another embodiment to yield a still 
further embodiment It is intended that the present invention covers such modifications, 
5 combinations, additions, deletions and variations as fall within the scope of the appended 
claims and their equivalents. 
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What is Claimed Is : 

1 . A method of generating a transgenic avian zygote by sperm-mediated transfection, 

said method comprising: 

(a) obtaining a suspension of avian male germ cells selected from the group 
5 consisting of spermatozoa and spermatozoal precursor cells; 

(b) introducing a nucleic acid comprising a transgene comprising a nucleotide 
sequence encoding a heterologous polypeptide to the avian male germ cells 
by lipofection, electroporation or restriction enzyme mediated integration; 

(c) delivering the avian male germ cells having the nucleic acid to an avian 

10 oocyte, 

thereby generating a transgenic avian zygote having the nucleic acid incorporated therein. 

2. The method of Claim 1 , wherein the avian male germ cells and the avian oocyte are 
obtained from a chicken. 

15 

3 . The method of Claim 1 , wherein the avian male germ cells and the avian oocytes are 
obtained from a quail. 

# 

4. The method of Claim 1 , wherein the nucleotide sequence encoding said 

20 heterologous,polypeptide is operably linked to a transcriptional regulatory element that can 
direct gene expression in one or more cells of said transgenic avian. 

5 . The method of Claim 4, wherein the transcriptional regulatory element is selected 
from the group consisting of the promoter regions of the avian genes encoding ovalbumin, 

25 lysozyme, ovomucoid, ovomucin, conalbumin and ovotransferrin. 

6. The method of Claim 5, wherein the selected nucleic acid further comprises a 
chicken lysozyme gene expression controlling region comprising the nucleotide sequence of 
SEQ ID NO: 7, 

30 

7. The method of Claim 4, wherein the transcriptional regulatory element is a tissue 
specific promoter. 

8. The method of Claim 7, wherein the tissue specific promoter is specific for the 
35 magnum. 
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9. The method of Claim 1 , wherein the transgene comprises at least one 
cytomegalovirus promoter. 

1 0. The method of Claim 9, wherein the transcriptional regulatory element comprises at 
5 least two regions derived from the promoter of an avian gene, said regions being from a 

different promoter. 

11. The method of Claim 10, wherein the transcriptional regulatory element has the 
nucleotide sequence of SEQ 3D NO: 1 1 . 

10 

12. The method of Claim 1 , wherein the transgene comprises at least one matrix 
attachment region (MAR). 

13. The method of Claim 12, wherein the transgene comprises a 5' MAR and a 3' MAR 
1 5 which flank said nucleotide sequence. 

14. The method of Claim 1 , wherein the heterologous polypeptide is selected from the 
group consisting of a cytokine, a hormone, an enzyme, a structural polypeptide, and an 
immuoglobulin polypeptide. 

20 

15. The method of Claim 14, wherein the cytokine is selected from the group consisting 
of interferon, interleukin, granulocyte colony-stimulating factor, granulocyte-macrophage 
colony-stimulating factor, stem cell factor, erythropoietin, thrombopoietin, and stem cell 
factor. 

25 

16. The method of Claim 15, wherein the cytokine is an interferon. 

17. The method of Claim 1, wherein the transgene comprises an internal ribosome entry 
site (IRES). 

30 

18. The method of Claim 1 7, wherein the transgene comprises at least two nucleotide 
sequences each encoding a heterologous polypeptide. 

19. The method of Claim 1 8, wherein the at least two nucleotide sequences encode at 
35 least two heterologous peptides that form a multimeric protein. 
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20. The method of Claim 1 9, wherein the multimeric protein specifically binds a 
selected ligand. 

* 

21. The method of Claim 20, wherein the multimeric protein is an antibody. 

5 

22. The method of Claim 1 , wherein the heterologous polypeptide comprises a peptide 
region suitable for the isolation of the heterologous polypeptide. 

23. The method of Claim 1 , wherein the nucleic acid is a eukaryotic viral vector. 

10 

24. The method of Claim 23, wherein the eukaryotic viral vector is derived from any of 
the group consisting of avian leukosis virus, adenovirus, transferrin-polylysine enhanced 
adenoviral vectors, human immunodeficiency virus vectors, lentiviral vectors, and Moloney 
murine leukemia virus-derived vectors. 

15 

25. The method of Claim 1 , wherein the nucleic acid is a plasmid vector. 

26. The method of Claim 1 , wherein the nucleic acid is a bacterial artificial chromosome 
(BAC). 

20 

27. The method of Claim 1 , wherein the nucleic acid is not a eukaryotic viral vector. 

28. The method of Claim 4, wherein the transcriptional regulatory element is a 
25 regulatable promoter. 

29. The method of Claim 6, wherein the selected nucleic acid further comprises a region 
encoding the 3 1 region of the chicken lysozyme gene and having the nucleotide sequence of 
SEQIDNO:9. 

30 

30. The method of Claim 1 , wherein the nucleotide sequence encoding said 
heterologous polypeptide comprises an origin of replication. 

31. The method of Claim 30, wherein the origin of replication is the SV40 origin of 
35 replication. 
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32. The method of Claim 1 , wherein the nucleic acid is selected from the group 
consisting of a linear nucleic acid, a plasmid, a viral nucleic acid, and an artificial 

* 

chromosome. 

5 33. The method of Claim 32, wherein the artificial chromosome further comprises a 
centromere and optionally a telomere. 

34. The method of Claim 32, wherein the linear nucleic acid has at least one cohesive 
end characterized by the cohesive end generated by a restriction endonuclease. 

10 

35. The method of Claim 3 2, wherein the linear nucleic acid has at least one blunt end. 

36. The method of Claim 34, wherein the at least one cohesive end is generated by 
chemical synthesis. 

15 

37. The method of Claim 34, wherein the at least one cohesive end is generated by an 
enzyme other than a restriction endonuclease. 

38. The method of Claim 34, wherein the at least one cohesive end is generated by a 
20 combination of chemical and enzymatic methods. 

39. The method of Claim 1 , wherein the nucleic acid is introduced to the avian male 
germ cells by restriction enzyme mediated integration. 

25 40. The method of Claim 39, further comprising the step of delivering to the avian male 
germ cells a restriction endonuclease capable of cleaving the genomic nucleic acid of the 
avian male germ cells. 

• • 

41. The method of Claim 40, wherein the nucleic acid is delivered sequentially with the 
30 restriction endonuclease to the avian male germ cells. 

42. The method of Claim 1 , wherein the nucleic acid is delivered to the avian male germ 
cells by adeno-associated virus-derived vector. 

35 
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43. The method of Claim 42, wherein the nucleic acid is bounded by inverted terminal 
repeat sequences. 

44. The method of Claim 42, wherein the nucleic acid is bounded by inverted terminal 
5 repeat sequences derived from an adeno-associated virus-derived vector. 

45. The method of Claim 42, wherein the adeno-associated virus-derived vector further 
comprises a transcription cassette capable of expressing an adeno-associated virus Rep 
protein. 

10 

46. The method of Claim 45, wherein the Rep protein is Rep 78. 

47. The method of Claim 45, wherein the nucleic acid bounded by inverted terminal 
repeat sequences is inserted in a first nucleic acid vector and the transcription cassette 

1 5 capable of expressing an adeno-associated virus Rep protein is inserted in a second nucleic 
acid vector. 

48. The method of Claim 1 , further comprising the step of irradiating the avian male 
germ cells, thereby cleaving the nuclic acid, wherein the radiation is selected from the group 

20 consisting of ultraviolet light, gamma rays, X-rays, and ultrasound. 

49. The method of Claim 1 , wherein the avian oocyte is an isolated oocyte, and wherein 
the avian male germ cells having the nucleic acid are delivered to the isolated oocyte by a 
method selected from the group consisting of microinjection, intxacytoplasmic sperm 

25 injection (ICSI), and artificial insemination. 

50. The method of Claim 49, wherein the avian male germ cells having the nucleic acid 
therein are delivered to the nucleus of the oocyte. 

30 51. The method of Claim 1 , wherein the nucleic acid forms an episome in the avian 
male germ cells. 

52. The method of Claim 1, wherein the nucleic acid in the avian oocyte is an episome. 
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53. The method of Claim 1, further comprising isolating an avian oocyte from the 

female of an avian by: 

(a) removing an ovum from a bird after ovulation and before fertilization; and 

(b) removing an albumen layer from the ovum. 

5 

54. The method of Claim 1 , further comprising the steps of: 

(a) fistulating an avian female; 

(b) delivering the transgenic avian zygote to the infundibulum of the avian 
female such that said transgenic avian zygote is subsequently laid by said 

10 avian female as a shelled egg; and 

(c) incubating the shelled egg until said shelled egg hatches, 
thereby producing a transgenic avian containing the transgene. 



15 55. The method of Claim 54, wherein the heterologous polypeptide is expressed in one 
or more cells of said transgenic avian . 

56. The method of Claim 55, wherein the heterologous polypeptide is expressed in the 
serum of said transgenic avian. 

20 

57. The method of Claim 55, wherein the heterologous polypeptide is expressed in the 
magnum of said transgenic avian. 

58. The method of Claim 54, further comprising the step of allowing the transgenic 
25 avian to develop to sexual maturity. 

59. The method of Claim 58, wherein the heterologous polypeptide is delivered to the 
white of a developing avian egg produced by the transgenic avian. 

30 60. The method of Claim 55 or 59 further comprising isolating said heterologous 
polypeptide from said transgenic avian or an egg produced by the transgenic avian. 

61 . A transgenic avian that produces at least one heterologous polypeptide in egg white, 
wherein the transgenic avian or founder ancestor of said transgenic avian was not produced 
35 using a eukaryotic viral vector. 
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62. A transgenic avian produced by the method of Claim 54. 

63. The transgenic avian of Claim 61 or 62, wherein the avian is a chicken. 

5 64. The transgenic avian of Claim 63, wherein the heterologous polypeptide is selected 
from the group consisting of a cytokine, a hormone, an enzyme, a structural protein, and an 
irnmunglobulin polypeptide. 

65. The transgenic avian of Claim 63, wherein the cytokine is an interferon. 

10 

66. The transgenic avian of Claim 6 1 or 62, wherein the transgenic avian produces a 
heterologous multimeric protein. 

67. The transgenic avian of Claim 66, wherein the heterologous multimeric protein 
1 5 specifically binds a selected ligand. 

68. The transgenic avian of Claim 66, wherein the heterologous multimeric protein is an 
antibody. 

20 69. An avian egg produced by the transgenic avian of Claim 61 or 62. 

70. An avian egg produced by the transgenic avian of any of Claims 63-68. 

71 . A heterologous protein heterologous protein produced by the transgenic avian of 
25 Claim 61 or 62, wherein the heterologous protein comprises a heterologous polypeptide 

selected from the group consisting of a cytokine, a hormone, an enzyme, a structural 
protein, and an immunoglobulin polypeptide. 

72. The heterologous polypeptide of Claim 71 , wherein the cytokine is an interferon. 

30 

73. The heterologous protein of Claim 71 , wherein the heterologous protein is a 
multimeric protein. 

74. The heterologous protein of Claim 7 1 , wherein the heterologous protein is an 
35 antibody. 
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SEQ ID NO: 6 

TGCCGCCTTC TTTG^TATTC ACTCTGTTGT ATTTCATCTC TTCTTGCCGA TGAAAGGATA 60 
TAACAGTCTG TA™ACAGTC TGTGAGGAAA TACTTGGTAT TTCTTCTGAT CAGTGTTTTT 120 
ATAAGTAATG TTGAATATTG GATAAGGCTG . TGTGTCCTTT GTCTTGGGAG ACAAAGCCCA 180 
CAGCAGGTGG tgGTTGGGGT GGTGGCAGCT CAGTGACAGG AGAGGTTTTT TTGCCTGTTT 240 
TTTTTTTTTT TT T * r TTTTTT A4GTAAGGTG TTCTTTTTTC TTAGTAA.VTT TTCTACTGGA 300 
CTGTATGTTT TG^CAGGTCA GAAACATTTC TTCAAAAGAA GAACCTTTTG GAAACTGTAC 360 
AGCCCTTTTC TTTCATTCCC TTTTTGCTTT CTC-TGCCAAT GCCTTTGGTT CTGATTGCAT 420 
TATGGAAAAC GTTGAT CGGA ACTTGAGGTT TTTATTTATA GTGTGGCTTG AAAGCTTGGA 480 
TAGCTGTTGT TACACGAGAT ACCTTATTAA GTTTAGGCCA GCTTGATGCT TTATTTTTTC 540 
CCTTTGAAGT 2 G t G AGCGTT CTCTGGTTTT TTTCCTTTGA AACTGGTGAG GCTTAGATTT 600 
TTCTAATGGG ATTTTTTACC TGATGATCTA GTTGCATACC CAAATGCTTG TAAATGTTTT 660 
CCTAGTTAAC P^GTTGATAA CTTCGGATTT ACATGTTGTA TATACTTGTC ATCTGTGTTT 720 
CTAGTAAAAA TATATGGCAT TTATAGAAAT ACGTAATTCC TGATTTCCTT TTTTTTTATC 780 
TCTATGCTCT Gt G t G TACAG GTCAAACAGA CTTCACTCCT ATTTTTATTT ATAGAATTTT 840 
ATATGCAGTC TG T CGTTGGT TCTTGTGTTG TAAGGATACA GCCTTAAATT TCCTAGAGCG 900 
ATGCTCAGTA AGGCGGGTTG TCACATGGGT TCAAATGTAA AACGGGCACG TTTGGCTGCT 960 
GCCTTCCCGA GVTCCAGGAC ACTAAACTGC TTCTGCACTG AGGTATAAAT CGCTTCAGAT 10 20 
CCCAGGGAAG TGCAGATCCA CGTGCATATT CTTAAAGAAG AATGAATACT TTCTAAAATA 1080 
TTTTGGCATA GG AAGCAAGC TGCATGGATT TGTTTGGGAC TTAAATTATT TTGGTAACGG 114 0 
AGTGCATAGG tt T TAAACAC AGTTGCAGCA TGCTAACGAG TCACAGCGTT TATGCAGAAG 1200 
TGATGCCTGG' ^VgcCTGTTG CAGCTGTTTA CGGCACTGCC TTGCAGTGAG CATTGCAGAT 1260 
AGGGGTGGGG TCTTTGTGT CGTGTTCCCA CACGCTGCCA CACAGCCACC TCCCGGAACA 1320 
CATCTCACCT GCTGGGTACT TTTCAAACCA TCTTAGCAGT AGTAGATGAG TTACTATGAA 13 80 
ACAGAGAAGT TCCTCAGTTG GATATTCTCA TGGGATGTCT TTTTTCCCAT GTTGGGCAAA 1440 
GTATGATAAA GCATCTCTAT TTGTAAATTA TGCACTTGTT AGTTCCTGAA. TCCTTTCTAT IS 00 
AGCACCACTT ATTGCAGCAG GTGTAGGCTC TGGTGTGGCC TGTGTCTGTG CTTCAATCTT 1560 
TTAAAGCTTC TXTGGAAATA CACTGACTTG ATTGAAGTCT CTTGAAGATA GTAAACAGTA 1620 
CTTACCTTTG AVCCCAATGA AATCGAGCAT TTCAGTTGTA AAAGAATTCC GCCTATTCAT 1680 
ACCATGTAAT G^TTTTAC ACCCCCAGTG CTGACACTTT GGAATATATT CAAGTAATAG 1740 
ACTTTGGCCT Ca-CCTCTTG TGTACTGTAT TTTGTAATAG AAAATATTTT AAACTGTGCA 1800 
TATGATTATT ACATTATGAA AGAGACATTC TGCTGATCTT CAAATGTAAG AAAATGAGGA 1860 
GTGCGTGTGC TTTTATAAAT ACAAGTGATT GCAAATTAGT GCAGGTGTCC TTAA^AAAAA 1920 
AAAAAAAAAG TAATATAAAA AGGACCAGGT GTTTTACAAG TGAAATACAT TCCTATTTGG 1980 
TAAACAGTTA CATTTTTATG AAGATTACCA GCGCTGCTGA CTTTCTAAAC ATAAGGCTGT 2040 
SiSSSSS ^ACCaSg CATTTCCTCA TTCCCAATTT GCA^GGAT *£TG^TAA !! 
ACTATTCAAG AAATGGCTTT GAAATACAGC ATGGGAGCTT GTCTGAGTTG GAATGCAGAG 2160 
TTGCACTGCA AAATGTCAGG AAATGGATGT CTCTCAGAAT GCCCAACTCC AAAGGATTTT 2220 
ATATGTGTAT ATAGTAAGCA GTTTCCTGAT TCCAGCAGGC CAAAGAGTCT GCTGAATGTT 2280 
GTGTTGCCGG AGACCTGTAT TTCTCAACAA GGTAAGATGG TATCCTAGCA ACTGOGGATT 2340 
TTAATACATT TTCAGCAGAA GTACTTAGTT AATCTCTACC TTTAGGGATC GTTTCATCAT 2400 
SSSS S SaCTGCATA ACTTTTAGCT "CATGGGTT CCTTTTTTTC 24 
AGCCTTTAGG AGACTGTTAA GCAATTTGCT GTCCAACTTT TGTGTTGGTC TTAAACTGCA 2520 
ATAGTAGTTT ACCTTGTATT GAAGAAATAA AGACCATTTT TATATTAAAA AATACTTTTG 2580 

TTTTGACTTG TCTGATATCC TTGCAGTGCC CATTATGTCA GTTCTGTCAG 2 640 
aSSSgS SSSSS AACGTGAGCT CAGTGGAGTT ACAGCTGCGG TTTTGATGCT 2700 
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T CTGAAA CXAG AAATC-AXGXX GTCTTCATCT GCTCATCAAA CACTTCATGC 27 60 
AGAgSIS GCTAGTGAGA AAXGCAXACA XXXAXXGATA CTTTTTTAAA GTCAACTTTT 2820 
ScSiTT? XXXXXXCAXX TGGAAATATA TTGTTTTCTA GACTGCATAG CXXCXGAAXC 28 80 
roiSSSS XCXGAXXGGC ATGAAGAAGC ACAGCACTCT TCATCTTACT TAAACXTCAT 2940 
TTTGGAATGA AGGAAGXXAA GCAAGGGCAC AC-GXCCATGA AATAGAGACA GXGCGCXCAG 300.0 
gSS^TgJ ACCTGG^TTT CTTTGGCTAG TGXXCXAAAX CTGTAGTGAG GAAAGTAACA 3060 
SSSSS XGAAAGGGCX CCAGCTTTAA TGCTTCCAAA TXGAAGGXGG CAGGCAACTT 3120 
nr^rzcTCCT T A.TTT ACTGC ATTATGTCTC AGTTTCGCAG CXAACCXGGC XXCXCCACJ.A 318U 
XXGAgSS aSaSc-CCX GGCXXCAGAG GCCAGGXGAA GGTTGGGATG GGXGGAAGGA 3240 
gScSgS SSgCTGGGG GGACTGTGGG GACXCCA^GC TGAGCTTGGG GXGGGCAGCA 3300 
TAr^rAAAAG TGTGGGTAAC TATTTTTAAG TACTGTGTTG CAAACGTCTC ATCTGCAAAT 3360 

TgSScTCG A.AGATTAACA GTGTGGGTTC AGXAAXATAT GGAXGAAXXC 3420 
SgSg CATTCAAGGG TAGATCATCT AACGACACCA GATCATCAAG CTATGATTGG 3480 
AAGCGCTA^C SSgAGCGA GGAAGGTAAG CAGTCTTCAT ATGTTTTCCC TCCACGTAAA 3540 

JSSSEcAC CCCTTGAGCA GAGACAAGGA AAXAAXTCAG GAGCATGTGC 3600 
TAcSSlfx ^cSgCTGA ATTCTACTTG CAAGAGCTTT GATGCCTGGC TTCTGGTGCC 3660 
SSgSgCA CCXGCAAGGC CCAGAGCCTG TGGTGAGCTG GAGGGAAAGA TTCTGCTCAA 3720 
ScCaScS CAGCAGGTCA TTGTCTTTGC TTCTTCCCCC AGCACTGTGC AGCAGAGTGG 3780 
SSSxSc gSgCCTCCT GTCCACTACC TGTTGCTGCA GGCAGACTGC TCTCAGAAAA 3840 
AgSSSaA ?TC?SG?CA XAGXCTGAAG GXAAAATGGG TTTTAAAAAA GAAAACACAA 3900 
A^?A?5cC ScSScCA TGAGAAGAAA GCAGTGGTAA ACATGGTAGA AAAGGTGCAG 3960 
A^rrrcCCAG SgXGXGAC AGGCCCCTCC TGCCACCTAG AGGCGGGAAC AAGCTTCCCT 4020 
G™S S A™XX XCXX^XGG GXXXXGXXXG GCGXXXGGXX 4080 
TTG AG AXXXA GACACAAGGG AAGCCXGAAA GGAGGXGXXG GGCAClAXXX TGGXTXGT.hA 4140 
ScCXGxIS XCAAAXAXAT AXXTTGTGAG .GGAGXGXAGC GAAXXGGCCA AXXTAAAATA 4200 
SSgSSg AGA^GAAGG CXGAGXAGTT GAGAGGGTAA CACGXTTAAX GAGATCXTCT 4260 
C^CXaSg CXTGTTTGAG XGGXGAGACC XXGGATAGGX GAGTGCTCTX 4320 

£££££ ££££ GCXTGXCCTX TTCCATCCAC ATCCAXGCAX JCCACATCCA 43 

SEES SS £™c SSSS SSESS ££££ So; 

™^ AtxAAGGAXX-XXCXCCXGGC XX^GACACCT CACGAAAXAG 45 

r _ TC;cc TTACAXXCXG GGCAXXATXX CAAAXATCXX XGGAGXGCGC XGCXCXCAAG 4bjU 
SSS SaScXXA GAGXGAAXGC XCXXAGAGXG AAAGAGAAGG AAGAGAAGAX 4680 
XiXGXGXCXX CCTAC.Lt.iiA ^ rTGAAXAAXG GCCAAAGGXG GGXGGGXXXC 4740 

GXXGGCCGCA GXXCXCXGAT GAACACACCX CXGAATAAIU TfirarTTATX 4800 

XCXGAGGAAC GGGCAGCGXX XGCCXGXGAA MCMQOPC t AGGTXCCCX GcScSS ^60 
TTGCAACTGA XGGXGGAACX GGXGCXXAAA GCAGATTCCC TAGGXTCCCX ^CXAC^X! 4860 
TTCCTTCTTG GCAGXCAGTT XAXXXCXGAC AGACAAACAG CCACCCCCAC TGCAGGCXTA 4920 

gIaSSxgx gStcScS gggxgxgtta cagctctgcc .ctgg^ ££££££ 5 

rrarrarrsT TCATCCCAAA CAGGATCCXC AXXCATGGAX CAAGCXGXAA GGAACTTGGC bu*u 

c?SScSc aaa^caSaa txggagxacg aaxgxaaxxa aaacxgcaxx CXCGCATXCC 5100 

tSSSSt AGXCTGGACT CXGCAGCATG XAGGXCGGCA GCXCCCACTX TCXCAAAGAC 5160 
XAAGTCAXXX AGXCTGGACT CGAXTCAGAA CAACCAACGG AGTGTTGCCG 5220 

CACTGAXGGA GGAGXAGXAA AAAXGGAGAC CGATICA^a.% aATAPATAAA 5280 

AAGAAACTGA TGGAAAXAAT GCAXGAAXXG XGXGGXGGAC JTXXXTXTXA AAXACAXAAA 52 
r-rarTTrasA TrAGGXCGGA GAAGGTCAGX GXXXXATTAG CAGCCAXAAA ACCAGGXGAG *J> - 
CGAGTACCAT SSSSSS AAGAAAAACG ATTCTGAGCT CTGCGTAAGT ATAAGTTCTC 5400 
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CATAGCGGCT GAAGCTCCCC CCTGGCTGCC 
CCTTGGGGTT TCTCTCACAG CAGTAATGGG 
TGTCATGTGG GATCCCTACT GTGCCCTCCT 
CAGCGGTTTG GAAAGAGAAA AAGAATTTGG 
CCAGCATTTT GGTTTTTAAT TATGTCAATA 
TGGGTGTATT ACCGAGGAAC AAAGGAAGGC 
ACTGGCAAGC TGTCAAAAAC AAAAAGGCCT 
C-CCAGCAGGG CCAGCACGAG GGATGGTGCA 
ACTCTGAGAG CAACTGCTTT GGAAATGACA 
TGCGTAGAGC GTGTGCTTGG CGACAGTTTT 
TCCTCATTCT CCTAAGCATG TCTCCATGCT 
ATGAATCCAT CACTGTAGGA TTCTCGTGGT 
ATGGAAGCTT ATTTATTTTT CGTTCTTCCA 
AC CACAGC AA ATTAAAGGTG AAGGAGGCTG 
TTCTTCCTTG CAAGGCCACA GGAAAATGCT 
AGTTCAGTCT CCTGCTGGGA CAGCTAACCG 
AGGACCAAAT AGGGTCTATC TGGGGTTTTT 
CACTATTTCA CTGCTCCCAC GGTTACAAAC 
ACATTACATA AATTTGACCT GGTACCAATA 
CTGTGTTTAA CCCCTTAAGG CATTCAGAAC 
AGGGGCCTTA AACATCATCC ATTTCCAACC 
CTCAGGCTGC CCAGCGCCCC ATCCAGCCTG 
ACAGCTTCTC TGGGCAGCCT GTGCCAACAC 
TTAACATCTA ATCTAAATCT CTTCTCTTTT 
CTATCTGTCC AAGAAATGTG TATTGGTCTC 
GGCTGCAGTG AGGTCTCCCC ACAGCCTTCT 
CAGCCTGTCT TCGTAGGAGA TCATCTTAGT 
CACGGCTTTC TTGTGGAGCC CCAGGTCTGG 
GCAGAGCAGA TGGGGACAAT CGCTTACCCC 
CCCAGGGTAC TGTTGGCCTT TCAGGCTCCC 
CATCCACCAG AACCCACGCT TCCTGGTTAA 
TCAGGAGACT TCCATTCTTT AGGACAGACT 
ATATACATTT CAGTTCATGT TTCCTGTAAC 
TACATGCAGA ATTCCTAGTG CCATCTCAGT 
CAATTTGCTG CAAGTACCTT CCAAGCTGCG 
TTACCTTTTG .GGGTAAGCTT TTGTATCTGC 
CTCTGCTCTG TTCTGACTGC ACCATTTTCT 
TTGTCCTCCA TCCTTTCCCA GCTTGTATCT 
CTTCAGCAGC CATTTAATTC TTCAGTGTCA 
TTTTCAGCAG TCTTGCAAAG AACATCTAGC 
CAGTTCTTCT TGTTTGAGGT GAGCCATAAA 
GCATTTTATT ACTTCTATTA TGTACTTACT 
CTGGGATTTC CACAGTGTCT CTGTGTCCTT 
AACCTTGGCA ATCTGCCCAG CTGCCCATCA 



TGCCATCTCA GCTGGAGTGC AGTGCCATTT 54 60 
ACAATACTTC ACAAAAATTC TTTCTTTTCC 5520 
GGTTTTACGT TACCCCCTGA CTGTTCCATT 5580 
AAATAAAACA TGTCTACGTT ATCACCTCCT 5640 
ACTGGCTTAG ATTTGGAAAT GAGAGGGGGT 5700 
TTATATAAAC TCAAGTCTTT TATTTAGAGA 5760 
TACCACCAAA TTAAGTGAAT AGCCGCTATA 5820 
CTGCTGGCAC TATGCCACGG CCTGCTTGTG 5880 
GCACTTGGTG CAATTTCCTT TGTTTCAGAA 594 0 
TCTAGTTAGG CCACTTCTTT TTTCCTTCTC 6000 
GGTAATCCCA GTCAAGTGAA CGTTCAAACA 6060 
GATCAAATCT TTGTGTGAGG TCTATAAAAT 6120 
TATCAGTCTT CTCTATGACA ATTCACATCC 618 0 
GTGGGATGAA GAGGGTCTTC TAGCTTTACG 6240 
GAGAGCTGTA GAATACAGCC TGGGGTAAGA 6300 
CATCTTATAA CCCCTTCTGA GACTCATCTT 63 60 
GTTCCTGCTG TTCCTCCTGG AAGGCTATCT 6420 
CAAAGATACA GCCTGAATTT TTTCTAGGCC 6480 
TTGTTCTCTA TATAGTTATT TCCTTCCCCA 654 0 
AACTAGAATC ATAGAATGGT TTGGATTGGA 6600 
CTCTGCCATG GGCTGCTTGC CACCCACTGG 6660 
GCCTTGAGCA CCTCCAGGGA TGGGGCACCC 6720 
CTCACCACTC TCTGGGTAAA GAATTCTCTT 6780 
AGTTTAAAGC CATTCCTCTT TTTCCCGTTG 6840 
CCTCCTGCTT ATAAGCAGGA AGTACTGGAA 6900 
CTTCTCCAGG CTGAACAAGC CCAGCTCCTT 6960 
GGCCCTCCTC TGGACCCATT CCAACAGTTC 7020 
ATGCAGTACT TCAGATGGGG CCTTACAAAG 7080 
TCCCTGCTGG CTGCCCCTGT TTTGATGCAG 7140 
AGACCCCTTG CTGATTTGTG TCAAGCTTTT 7200 
TACTTCTGCC CTCACTTCTG TAAGCTTGTT 7260 
GTGTTACACC TACCTGCCCT ATTCTTGCAT 7320 
AGGACAGAAT ATGTATTCCT CTAACAAAAA 7380 
AGGGTTTTCA TGGCAGTATT AGCACATAGT 7440 
GCCTCCCATA AATCCTGtAT TTGGGATCAG 750 Q 
AGAGACCCTG GGGGTTCTGA TGTGCTTCAG 7560 
AGATCACCCA GTTGTTCCTG TACAACTTCC 7620 
TTGACAAATA CAGGCCTATT TTTGTGTTTG 7680 
TCTTGTTCTG TTGATGCCAC TGGAACAGGA 7740 
TGAAAACTTT CTGCCATTCA ATATTCTTAC 7800 
TTACTAGAAC TTCGTCACTG AC»lAGTTTAT 7860 
TTGACATAAC ACAGACACGC ACATATTTTG 7920 
CACATGGTTT TACTGTCATA CTTCCGTTAT 7980 
CAAGAAAAGA GATTCCTTTT TTATTACTTC 8040 



FIG. 1C 



WO 03/024 1!)9 PCT/US02/30156 

4/31 



TCTTC^GCCA ATAAACAAAA TGTGAGAAGC CCAAACAAGA ACTTGTGC-GG CAGGCTGCCA 8100 
TCAAGGGAGA GACAGCTGAA GGGTTGTGTA GCTCAATAGA ATTAAGAAAT AATAAAGCTG 8160 
TGTCAGACAG TTTTGCCTGA TTTATACAGG CACGCCCCAA GCCAGAGAGG CTGTCTGCCA .8220 
AGGCCACCTT GCAGTCCTTG GTTTGTAAGA TAAGTCATAG GTAACTTTTC TGGTGAATTG ' 8280 
CGTGGAGAAT CATGATGGCA GTTCTTGCTG TTTACTATGG TAAGATGCTA AAAT AGGAGA 8340 
CAGCAAAGTA ACACTTGCTG CTGTAGGTGC TCTGCTATCC AGACAGCGAT GGCACTCGCA 8400 
CACCAAGATG AGGGATGCTC CCAGCTGACG GATGCTGGGG CAGTAACAGT GGGTCCCATG 84 60 
CTGCCTGCTC ATTAGCATCA CCTCAGCCCT CACCAGCCCA TCAGAAGGAT CATCCCAAGC 8520 
TG^GGAAAGT TGCTCATCTT CTTCACATCA TCAAACCTTT GGCCTGACTG ATGCCTCCCG 8580 
GATGCTTAAA TGTGGTCACT GACATCTTTA TTTTTCTATG ATTTCAAGTC AGAACCTCCG 8640 
GATCAGGAGG GAACACATAG TGGGAATGTA CCCTCAGCTC CAAGGCCM3A TCTTCCTTCA 8700 
A.TGATCATGC ATGCTACTTA GGAAGGTGTG TGTGTGTGAA TGTAGAATTG CCTTTGTTAT 8760 
TTTTTCTTCC TGCTGTCAGG AACATTTTGA ATACCAGAGA AAAAGAAAAG TGCTCTTCTT 8820 
GGCATGGGAG GAGTTGTCAC ACTTGCAAAA TAAAGGATGC AGTCCC-AAT GTTCATAATC 8880 
VCAGGGTCTG AAGGAGGATC AGAAACTGTG TATACAATTT CAGGCTTCTC TGAATGCAGC 894 0 
TTTTGAAAGC TGTTCCTGGC CGAGGCAGTA CTAGTCAGAA CCCTCGGAAA CAGGAACAAA 9000 
TGTCTTCAAG GTGCAGCAGG AGGAAACACC TTGCCCATCA TGAAAGTGAA TAACCACTGC 9060 
CGCTGAAGGA ATCCAGCTCC TGTTTGAGCA GGTGCTGCAC ACTCCCACAC TGAAACAACA 9120 
GTTCATTTTT ATAGGACTTC CAGGAAGGAT CTTCTTCTTA AGCTTCT7AA TTATGGTACA 9180 
TCTCCAGTTG GCAGATGACT ATGACTACTG ACAGGAGAAT GAGGAACTAG CTGGGAATAT 9240 
VTCTGTTTGA cCACCATGGA GTCACCCATT TCTTTACTGG TATTTGGAAA TAATAATTCT 9300 
GAATTGCAAA GCAGGAGTTA GCGAAGATCT TCATTTCTTC CATGTTGGTG ACAGCACAGT 9360 
TCTGGCTATG AAAGTCTGCT TACAAGGAAG AGGATAAAAA TCATAGGGAT AATAAATCTA 9420 
AGTTTGAAGA CAATGAGGTT TTAGCTGCAT TTGACATGAA GAAATTGAGA CCTCTACTGG 9480 
\tAGCTATGG TATTTACGTG TCTTTTTGCT TAGTTACTTA TTGACCCCAG CTGAGGTCAA 9540 
GTATGAACTC AGGTCTCTCG GGCTACTGGC ATGGATTGAT TACATACAAC TGTAATTTTA 9600 
GCAGTGATTT AGGGTTTATG AGTACTTTTG CAGTAAATCA TAGGGTTAGT AATGTTAATC 96SQ 
TCAGGGAAAA AAAAAAAAAG CCAACCCTGA CAGACATCCC AGCTCAGGTG GAAATCAAGG 9720 
ATCACAGCTC AGTGCGGTCC CAGAGAACAC AGGGACTCTT CTCTTAGGAC CTTTATGTAC 9780 
AGGGCCTCAA GATAACTGAT GTTAGTCAGA AGACTTTCCA TTCTGGCCAC AGTTCAGCTG 9840 
AGGCAATCCT GGAATTTTCT CTCCGCTGCA CAGTTGCAGT CATCCCAGTT TGTACAGTTC 9900 
TGGCACTTTT TGGGTCAGGC CGTGATCCAA GGAGCAGAAG TTCCAGCTAT GGTCAGGGAG 9960 
TGCCTGACCG TCCCAACTCA CTGC ACT CAA ACAAAGGCGA AACCACAAGA GTGGCTTTTG 10020 
XTGAAATTGC AGTGTGGCCC AGAGGGGCTG CACCAGTACT GGATTGACCA CGAGGCAACA 10080 
TTAATCCJCA GCAAGTGCAA TTTGCAGCCA TTAAATTGAA CTAACTGATA CTACAATGCA 10140 
ATCAGTATCA ACAAGTGGTT TGGCTTGGAA GATGGAGTCT AGGGGCTCTA CAGGAGTAGC 10200 
TACTCTCTAA TGGAGTTGCA TTTTGAAGCA GGACACTGTG AAAAGCTGGC CTCCTAAAGA 10260 
GGCTGCTAAA CATTAGGGTC AATTTTCCAG TGCACTTTCT GAAGTGTCTG CAGTTCCCCA 10320 
TGCAAAGCTG CCCAAACATA GCACTTCCAA TTGAATACAA TTATATGCAG GCGTACTGCT 10380 
TCTTGCCAGC ACTGTCCTTC TCAAATGAAC TCAACAAACA ATTTCAAAGT ^TAGAAA 10440 
GTAACAAGCT TTGAATGTGA TTAAAAAGTA TATCTGCTTT CAGTAGTTCA GCTTATTTAT 10500 
GCCCACTAGA AACATCTTGT ACAAGCTGAA CACTGGGGCT CCAGATTAGT GGTAAAACCT 10560 
ACTTTATACA ATCATAGAAT CATAGAATGG CCTGGGTTGG AAGGGACCCC AAGGATCATG 10620 
AAGATCCAAC ACCCCCGCCA CAGGCAGGGC CACCAACCTC CAGATCTGGT ACTAGACCAG 10680 
GCAGCCCAGG GCTCCATCCA ACCTGGCCAT GAACACCTCC AGGGATGGAG CATCCACAAC 10740 
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CTCTCTGGGC AGCCTGTGCC AGCACCTCAC 
ATCCAATCTA AGCCTTCCCT CCTTGAGGTT 
TACTCTTGTA AAAAGTTGAT TCTCCTCCTT 
GCCTTCTTCT CTTCTGCAGG ATGAACAAGC 
GGTGCTCCAG CCCTCTGATC ATCTTTGTGG 
CATCTTTCCT GTACTGGGGG CCCCAGGCCT 
GAGCAGAGTA AAGAGGGACA ATCACCTTCC 
AGCCCTGGAT ACAACTGGCT TTCTGAGCTG 
ACAGGAACAA TACAACAGGT GCTGATGGCC 
GGTAGATCTT AGATGAGGAA CGTTGAAGTT 
ATACTCCTGC CTGATACCTC ACCCCACCTG 
CAGGGCCCTG ATGAACCCGG CACTGCTTCA 
TTGCACCTAT GAATACACAA ACAATGTGTT 
AATTTGCATT GTCAGGAAAT GGTTTAGTAA 
TGGCTGTTTT TATGGCTGTT AGTAGTGGTA 
AATCAAGACT GTAGATATTG CAACAGACTA 
TACTTCCCAC ATTGTATAAG AAATTTGGCA 
ATTTCTGTAT ACTCAAGAGG GCGTTTTTGA 
TGGGAGGAAG TTAAAAGAAG AGGCAGGTGC 
ACACTGGCAA- CATGAGGTCT TTGCTAATCT 
TAGGGTGCGA TCTGCCTCAG ACCCACAGCC 
CTCAGATGAG GAGAATCAGC CTGTTTAGCT 
CT CAAGAGG A GTTTGGCAAC CAGTTTCAGA 
TGATCCAGCA GATCTTTAAC CTGTTTAGCA 
CCCTGCTGGA TAAGTTTTAC ACCGAGCTGT 
TGAT CCAGGG CGTGGGCGTG ACCGAGACCC 
TGAGGAAGTA CTTTCAGAGG ATCACCCTGT 
CTTGGGAAGT CGTGAGGGCT GAGATCATGA 
AGAGCTTGAG GTCTAAGGAG TAAAAAGTCT 
ACATGATAAG ATACATTGAT GAGTTTGGAC 
GCTTTATTTG TGAAATTTGT GATGCTATTG 
AACAAGTTAA CAACAACAAT TGCATTCATT 
AGGTTTTTTA AAGCAAGTAA AACCTCTACA 
GCGGCCGC 12728 * 



CACCCTCTCT GTGAAGAACT TTTCCCTGAC 10800 
AGATCCACTC CCCCTTGTGC TATCACTGTC 10860 
TTTGGAAGGT TGCAATGAGG TCTCCTTGCA i0920 
CCAGCTCCCT CAGCCTGTCT TTATAGGAGA 10980 
CCCTCCTCTG GACCCGCTCC AAGAGCTCCA 11040 
GAATGCAGTA CTCCAGATGG GGCCTCAAAA 11100 
TCACCCTGCT GGCCAGCCCT CTTCTGATGG 11160 
CA^CTTCTCC TTATCAGTTC CACTATTAAA 11220 
AGTGCAGAGT TTTTCACACT TCTTCATTTC 11280 
GTGCTTCTGC GTGTGCTTCT TCCTCCTCAA 113 40 
CCACTGAATG GCTCCATGGC CCCCTGCAGC 11400 
GATGCTGTTT AATAGCACAG TATGAC CAAG 11460 
GCATCCTTCA GCACTTGAGA AGAAGAGCCA 11520 
TTCTGCCAAT TAAAACTTGT TTATCTACCA 11580 
CACTGATGAT GAACAATGGC TATGCAGTAA 11640 
TAAAATTCCT CTGTGGCTTA GCCAATGTGG 11700 
AGTTTAGAGC AATGTTTGAA GTGTTGGGAA 11760 
CAACTGTAGA ACAGAGGAAT CAAAAGGGGG 11820 
AAGAGAGCTT GCAGTCCCGC TGTGTGTACG 118 80 
TGGTGCTTTG CTTCCTGCCC CTGGCTGCCT 11940 
TGGGCAGCAG GAGGACCCTG ATGCTGCTGG 12000 
GCCTGAAGGA TAGGCACGAT TTTGGCTTTC 12060 
AGGCTGAGAC CATCCCTGTG CTGCACGAGA 12120 
CCAAGGATAG CAGCGCTGCT TGGGATGAGA 12180 
ACCAGCAGCT GAACGATCTG GAGGCTTGCG 12240 
CTCTGATGAA GGAGGATAGC ATCCTGGCTG 12300 
ACCTGAAGGA GAAGAAGTAC AGCCCCTGCG 12360 
GGAGCTTTAG CCTGAGCACC AACCTGCAAG 12420 
AGAGTCGGGG CGGCCGGCCG CTTCGAGCAG 12480 
AAACCACAAC TAGAATGCAG TGAAAAAAAT 12540 
CTTTATTTGT AACCATTATA AGCTGCAATA 12600 
TTATGTTTCA GGTTCAGGGG GAGGTGTGGG 12660 
AATGTGGTAA AATCGATAAG GATCCGTCGA 12720 
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SEQ ID NO: 5 

TGCGATCTGC CTCAGACCCA CAGCCTGGGC AGCAGGAGGA CCCTGATGCT GCTGGCTCAG 60 

SgaggagS TCAGCCTGTT TAGCTGCCTG AAGGATAGGC ACGATTTTGG CTTTCCTCAA 120 

SSSSSg GCAACCAGTT TCAGAAGGCT GAGACCATCC CTGTGCTGCA CGAGATGATC 180 

CAGCAGATCT TTAACCTGTT TAGCACCAAG GATAGCAGCG CTGCTTGGGA TGAGACCCTG 240 

CTGGATAAGT TTTACACCGA GCTGTACCAG CAGCTGAACG ATCTGGAGGC TTGCGTGATG 300 

GcSgACCGA GACCCCTCTG ATGAAGGAGG ATAGCATCCT gCTGTGAGG 360 

AAGTACT T TC AGAGGATCAC CCTGTACCTG AAGGAGAAGA AGTACAGCCC CTGCGCTTGG 420 

GAAGTCGTGA GGGCTGAGAT CATGAGGAGC TTTAGCCTGA GCACCAACCT GCAAGAGAGC 480 
TTGAGGTCTA AGGAGTAA 4 98 
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.-rrrrTTCT ATTT c^TCTC TTCTTGCCGA TGAAAGGATA 60 
TGCCGCCTTC TTTGATATTC ACTCTGTTGT JTTiCATLi TTCTTCTGAT CAGTGTTTTT 120 
TAACAGTCTG TATAACAGTC JGTGAGGAAA ^TxGGTAT TxCT ^ 
ATAAGTAATG .TTGAATATTG GATAAGGCTG TG1G1CCTJ.1 ^ TTGCCTGTTT 240 

CAGCAGGTGG TGGTTGGGGT ^GGCAGCX CAGTGACAGG ^^ SSaSSa 300 
TTTTTTTTTT TTTTTTTTTT ^GTAAGGTG ««lTTWC ™ GAAACTGTAC 360 

ScSSSc JSSSS SSSSS* G C CTTTGGTT CTGATTGCAT 420 

AGCCCTTTTC TTiCAIl^u ^ TTTATTTATA GTGTGGCTTG AAAGCTTGGA 480 

TATGGAAAAC GTTGATCGGA JCTTGAGGTT ™ ^ GCTTGATGCT TTATTTTTTC 540 
TAGCTGTTGT TAGACGAGAT ACCTTATTAA GTTTAGO ^ CTGGTGm GCTTAGATTT 600 
CCTTTGAAGT AGTGAGCGTT CTCTGGTTTT TTTCClTTC. TAAATGTTTT 660 

TTCTAATGGG ATTTTTTACC ^ATGATCTA GTTGCATACC ^ atctgtgttt 720 

CCTAGTTAAC ATGTTGATAA ClTCGGATTT ALAl<-i±.^i a TTTCCTT TTTTTTTATC 780 

CTAGTAAAAA TATATGGCAT XTATAGA.AAT ACGTAATTCC JGATTTCCTT TTTTTT ^ 
TCTATGCTCT GTGTGTACAG GTCAAACAGA CTTCACTCCT £TTT ^ 
ATATGCAGTC TGTCGTTGGT TCTTGTGTTG TAACCAxAC TTTGGCTGCT 960 

ATGCTCAGTA AGGCGGGTTG ^ACATGGGT TCAAA GTA. CGCTTCAGAT 1020 

GCCTTCCCGA GUCCMOC JCTAAACTGC TTCTGCACTG AGGT ^ TTCTAAAATA 1080 
CCCAGGGAAG TGCAGATCCA CGTGCATATT £TTAA~Aft TTAAATTATT TTGGTAACGG 1140 
TTTTGGCATA GGA^GCAAGC ^CATGGATT ^TGCGAC ™ TATGCAGAAG 1200 

AGTGCATAGG TTTTAAACAC JGTiGCAGCA TGC£ ACCAG CATTGCAGAT 1260 

TGATGCCTGG ATGCCTGTTG CAGCTGTTTA CGGC CiCCC TCCCGGAACA 1320 

AGGGGTGGGG TGCTTTGTGT CGTGTTCCCA CMOj^GCCA JACA TTACTATGAA 1380 

CATCTCACCT GCTGGGTACT TTTCAAACCA. TCTi A ^!, J^TTCCCAT GTTGGGCAAA 1440 
ACAGAGAAGT TCCTCAGTTG OATATTCTCA TG^jCT TCCTTTCTAT 1500 

GTATGATAAA GCATCTCTAi ^GlAAATTA ^GCA^ TGTGTCTGTG CTTCAATGTT 1560 
AGCACCACTT ATTGCAGCAG GTGTAGGGTC TGG.CTGGCC TG QTAAACAGTA 1620 
TTAAAGCTTC TTTGGAAATA ^TGACTTG ATTGAAO ^g^ttcc GCCTATTCAT 1680 
CTTACCTTTG ATCCCAATGA AATCGAGCAT gMW GGAATATATT CAAGTAATAG 1740 
ACCATGTAAT GTAATTTTAC ACCCCCAGTG ^ACAC aAAATATTTT AAACTGTGCA 1800 
ACTTTGGCCT CACCCTCTTG TGTACTGTAT TTTG^ AT cSaatgxaag aaAATGAGGA 1860 
TATGATTATT ACATTATGAA AGAGACATTC TGCTCAlU TCC OTAAAAAAAA 1920 

GTGCGTGTGC TTTTATAAAT ACAAGTGATT GCAAA TAGT TC CTATTTGG 1980 

AAAAAAAAAG TAATATAAAA AGGACCAGGT ^TTACAAG ■ ^ ATAAGGCTQT 2040 

TAAACAGTTA CATTTTTATG AAGATTACCA GCGCiGCTC, , GTCTGGGTAA 2100 

ATTGTCTTCC. TGTACCATTG CATTJCCTCA. gCCCAATg S TCT GAGTTG GAATGCAGAG 2160 
ACTAXTCAAG AAATGGCTTT GAAATACAGC ATGGGAGCTT GT ^ggatttt 2220 

TTGCACTGCA AAATGTCAGG AAATGGATGT GTCTCAGAAT GCTGAATGTT 2280 

ATATGTGTAT ATAGTAAGCA GTTTCCTGAT TCCAGCAGGC CAAAGAGTCT ^ ^ 
GTGTTGCCGG AGACCTGTAT TTCTCAACAA GGTAAGATGG TATCC^GCA ^ 
TTAATACATT TTCAGCAGAA GaACTTAGTT AATCXCa v. CCTTTTTTTC 2460 

TTTTAGATGT TATACTTGAA ATACTGCATA JJ^JfJE IAtoTTCGTC TTAAACTGCA 2520 
AGCCTTTAGG AGACTGTTAA GCAATTTGCT GTCCAACTTT TGTGTTGGTC TT Q ^ 

ATAGTAGTTT ACCTTGTATT GAAGAAATAA JGACCATTTT ^TTAAAA 2640 
TCTGTCTTCA TTTTGACTTG TCTGATATCC ™»GTOCC ^TGTCA G 27Q0 
ATATTCAGAC ATCAAAACTT A^CGTGAGCT CAGTGGAGTT ACAGCl^W, 
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GTTATTATTT CTGAAACTAG AAATGATGTT GTCTTCATCT GCTCATCAAA CACTTCATGC 2760 
AGAGTGTAAG GCTAGTGAGA AATGCATACA TTTATTGATA CTTTTTTAAA GTCAACTTTT -2820 
TATCAGATTT TTTTTTCATT TGGAAATATA TTGTTTTCTA GACTGCATAG CTTCTGAATC 2880 
TGAAATGCAG TCTGATTGGC ATGAAGAAGC ACAGCACTCT TCATCTTACT TAAACTTCAT 2 940 
TTTGGAATGA AGGAAGTTAA GCAAGGGCAC AGGTCCATGA AATAGAGACA GTGCGCTCAG 3 000 
GAGAAAGTGA ACCTGGATTT CTTTGGCTAG TGTTCTAAAT CTGTAGTGAG GAAAGTAACA 3 060 
CCCGATTCCT TGAAAGGGCT CCAGCTTTAA TGCTTCCAAA TTGAAGGTGG CAGGCAACTT 3120 
GGCCACTGGT TATTTACTGC ATTATGTCTC AGTTTCGCAG CTAACCTGGC TTCTCCACTA 3180 
TTGAGCATGG ACTATAGCCT GGCTTCAGAG GCCAGGTGAA GGTTGGGATG GGTGGAAGGA 3 240 
GTGCTGGGCT GTGGCTGGGG GGACTGTGGG GACTCCAAGC TGAGCTTGGG GTGGGCAGCA 3 3 00 
CAGGGAAAAG TGTGGGTAAC TATTTTTAAG TACTGTGTTG CAAACGTCTC ATCTGCAAAT 3 3 60 
ACGTAGGGTG TGTACTCTCG AAGATTAACA GTGTGGGTTC AGTAATATAT GGATGAATTC 3 420 
ACAGTGGAAG CATTCAAGGG TAGATCATCT AACGACACCA GATCATCAAG CTATGATTGG 348 0 
AAGCGGTATC AGAAGAGCGA GGAAGGTAAG CAGTCTTCAT ATGTTTTCCC TCCACGTAAA 3 54 0 
GCAGTCTGGG AAAGTAGCAC CCCTTGAGCA GAGACAAGGA AATAATTCAG GAGCATGTGC 3 600 
TAGGAGAACT TTCTTGCTGA ATTCTACTTG CAAGAGCTTT GATGCCTGGC TTCTGGTGCC 3 660 
TTCTGCAGCA CCTGCAAGGC CCAGAGCCTG TGGTGAGCTG GAGGGAAAGA TTCTGCTCAA 3720 
GTCCAAGCTT CAGCAGGTCA TTGTCTTTGC TTCTTCCCCC AGCACTGTGC AGCAGAGTGG 3780 
AACTGATGTC GAAGCCTCCT GTCCACTACC TGTTGCTGCA GGCAGACTGC TCTCAGAAAA 3 840 
AGAGAGCTAA CTGTATGCCA TAGTCTGAAG GTAAAATGGG TTTTAAAAAA -GAAAACACAA 3900 
AGGCAAAACC GGCTGCCCCA TGAGAAGAAA GCAGTGGTAA ACATGGTAGA AAAGGTGCAG 3 960 
AAGCCCCCAG GCAGTGTGAC AGGCCCCTCC TGCCACCTAG AGGCGGGAAC AAGCTTCCCT 4020 
GCCTAGGGCT CTGCCCGCGA AGTGCGTGTT TCTTTGGTGG GTTTTGTTTG GCGTTTGGTT 4080 
TTGAGATTTA GACACAAGGG AAGCCTGAAA GGAGGTGTTG GGCACTATTT TGGTTTGTAA 4140 
AGCCTGTACT TCAAATATAT ATTTTGTGAG GGAGTGTAGC GAATTGGCCA ATTTAAAATA 4200 
AAGTTGCAAG AGA7TGAAGG CTGAGTAGTT GAGAGGGTAA CACGTTTAAT GAGATCTTCT 4260 
GAAACTACTG CTTCTAAACA CTTGTTTGAG TGGTGAGACC TTGGATAGGT GAGTGCTCTT 4320 
GTTACATGTC TGATGCACTT GCTTGTCCTT TTCCATCCAC ATCCATGCAT TCCACATCCA 4380 
CGCATTTGTC ACTTATCCCA T ATCTGT CAT ATCTGACATA CCTGTCTCTT CGTCACTTGG 4440 
TCAGAAGAAA CAGATGTGAT AATCCCCAGC CGCCCCAAGT TTGAGAAGAT GGCAGTTGCT 4500 
TCTTTCCCTT TTTCCTGCTA AGTAAGGATT TTCTCCTGGC TTTGACACCT CACGAAATAG 4560 
TCTTCCTGCC TTACATTCTG GGCATTATTT CAAATATCTT TGGAGTGCGC TGCTCTCAAG 4620 
TTTGTGTCTT CCTACTCTTA GAGTGAATGC TCTTAGAGTG AAAGAGAAGG AAGAGAAGAT 4680 
GTTGGCCGCA GTTCTCTGAT GAACACACCT CTGAATAATG GCCAAAGGTG GGTGGGTTTC 4740 
TCTGAGGAAC GGGCAGCGTT TGCCTCTGAA AGCAAGGAGC TCTGCGGAGT TGCAGTTATT 4800 
TTGCAACTGA fGGTGGAACT GGTGCTTAAA GCAGATTCCC TAGGTTCCCT GCTACTTCTT 4860 
TTCCTTCTTG GCAGTCAGTT TATTTCTGAC AGACAAACAG CCACCCCCAC TGCAGGCTTA 4920 
GAAAGTATGT GGCTCTGCCT GGGTGTGTTA CAGCTCTGCC CTGGTGAAAG GGGATTAAAA 4 980 
CGGGCACCAT TCATCCCAAA CAGGATCCTC ATTCATGGAT CAAGCTGTAA GGAACTTGGG 5040 
CTCCAACCTC AAMCATTAA TTGGAGTACG AATGTAATTA AAACTGCATT CTCGCATTCC 5100 
TAAGTCATTT AGTCTGGACT CTGCAGCATG TAGGTCGGCA GCTCCCACTT TCTCAAAGAC. 5160 
CACTGATGGA. GGAGTAGTA\ AAATGGAGAC CGATTCAGAA CAACCAACGG AGTGTTGCCG 5220 
AAGAAACTGA. TGGAAATAAT GCATGAATTG TGTGGTGGAC ATTTTTTTTA AATACATAAA 5280 
CTACTTCAAA TGAGGTCGGA GAAGGTCAGT GTTTTATTAG CAGCCATAAA ACCAGGTGAG 5340 
CGAGTA^CAT TOTCTCTAC AAGAAAAACG ATTCTGAGCT CTGCGTAAGT . ATAAGTTCTC 5400 
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CATAGCGGCT GAAGCTCCCC CCTGGCTGCC TGCCATCTCA GCTGGAGTGC AGTGCCATTT 54 60 
CCTTGGGGTT TCTCTCACAG CAGTAATGGG ACAATACTTC AC AAAAATT C TTTCTTTTCC 5520 
TGTCATGTGG GATCCCTACT GTGCCCTCCT GGTTTTACGT TACCCCCTGA CTGTTCCATT 5580 
CAGCGGTTTG GAAAGAGAAA AAGAATTTGG AAATAAAACA TGTCTACGTT ATCACCTCCT 564 0 
CCAGCATTTT GGTTTTTAAT TATGTCAATA ACTGGCTTAG ATTTGGAAAT GAGAGGGGGT 5700 
TGGGTGTATT ACCGAGGAAC AAAGGAAGGC TTATATAAAC TCAAGTCTTT TATTTAGAGA 5760 
ACTGGCAAGC TGTCAAAAAC AAAAAGGCCT TACCACCAAA TTAAGTGAAT AGCCGCTATA 582 0 
GCCAGCAGGG CCAGCACGAG GGATGGTGCA CTGCTGGCAC TATGCCACGG CCTGCTTGTG 5880 
ACTCTGAGAG CAACTGCTTT GGAAATGACA GCACTTGGTG CAATTTCCTT TGTTTCAGAA 5940 
TGCGTAGAGC GTGTGCTTGG CGACAGTTTT TCTAGTTAGG CCACTTCTTT TTTCCTTCTC 60 00 
TCCTCATTCT CCTAAGCATG TCTCCATGCT GGTAATCCCA GTCAAGTGAA CGTTCAAACA 6060 
ATGAATCCAT CACTGTAGGA TTCTCGTGGT GATCAAATCT TTGTGTGAGG TCTATAAAAT 6120 
ATGGAAGCTT ATTTATTTTT CGTTCTTCCA TATCAGTCTT CTCTATGACA ATTCACATCC 6180 
ACCACAGCAA ATTAAAGGTG AAGGAGGCTG GTGGGATGAA GAGGGTCTTC TAGCTTTACG 624 0 
TTCTTCCTTG CAAGGCCACA GGAAAATGCT GAGAGCTGTA GAATACAGCC TGGGGTAAGA 63 00 
AGTTCAGTCT CCTGCTGGGA CAGCTAACCG CAT CTTAT AA CCCCTTCTGA GACTCATCTT 63 60 
AGGACCAAAT AGGGTCTATC TGGGGTTTTT GTTCCTGCTG TTCCTCCTGG AAGGCTATCT 6420 
CACTATTTCA CTGCTCCCAC GGTTACAAAC CAAAGATACA GCCTGAATTT TTTCTAGGCC 6480 
ACATTACATA AATTTGACCT GGTACCAATA TTGTTCTCTA TATAGTTATT TCCTTCCCCA 654 0 
CTGTGTTTAA CCCCTTAAGG CATTCAGAAC AACTAGAATC ATAGAATGGT TTGGATTGGA 6600 
AGGGGCCTTA AACATCATCC ATTTCCAACC CTCTGCCATG GGCTGCTTGC CACCCACTGG 6660 
CTCAGGCTGC CCAGGGCCCC ATCCAGCCTG GCCTTGAGCA CCTCCAGGGA TGGGGCACCC 6720 
ACAGCTTCTC TGGGCAGCCT GTGCCAACAC CTCACCACTC TCTGGGTAAA GAATTCTCTT 6780 
TTAACATCTA ATCTAAATCT CTTCTCTTTT AGTTTAAAGC CATTCCTCTT TTTCCCGTTG 6840 
CTATCTGTCC AAGAAATGTG TATTGGTCTC CCTCCTGCTT ATAAGCAGGA AGTACTGGAA 6900 
GGCTGCAGTG AGGTCTCCCC ACAGCCTTCT CTTCTCCAGG • CTGAACAAGC CCAGCTCCTT 6960 
CAGCCTGTCT TCGTAGGAGA TCATCTTAGT GGCCCTCCTC TGGACCCATT CCAACAGTTC 7020 
CACGGCTTTC TTGTGGAGCC CCAGGTCTGG ATGCAGTACT TCAGATGGGG CCTTACAAAG 7080 
GCAGAGCAGA TGGGGACAAT CGCTTACCCC TCCCTGCTGG CTGCCCCTGT TTTGATGCAG 7140 
CCCAGGGTAC TGTTGGCCTT TCAGGCTCCC AGACCCCTTG CTGATTTGTG TCAAGCTTTT 7200 
CATCCACCAG AACCCACGCT TCCTGGTTAA TACTTCTGCC CTCACTTCTG TAAGCTTGTT 72 60 
TCAGGAGACT TCCATTCTTT AGGACAGACT GTGTTACACC TACCTGCCCT ATTCTTGCAT 7320 
ATAtACATTT CAGTTCATGT TTCCTGTAAC AGGACAGAAT ATGTATTCCT CTAACAAAAA 7380 
TACATGCAGA ATTCCTAGTG CCATCTCAGT AGGGTTTTCA TGGCAGTATT AGCACATAGT 7440 
CAATTTGCTG CAAGTACCTT £CAAGCTGCG GCCTCCCATA AATCCTGTAT TTGGGATCAG 7500 
TTACCTTTTG GGGTAAGCTT TTGTATCTGC AGAGACCCTG GGGGTTCTGA TGTGCTTCAG 7560 
CTCTGCTCTG TTCTGACTGC ACCATTTTCT AGATCACCCA GTTGTTCCTG TACAACTTCC 7520 
TTGTCCTCCA TCCTTTCCCA GCTTGTATCT TTGACAAATA CAGGCCTATT TTTGTGTTTG 7680 
CTTCAGCAGC CATTTAATTC TTCAGTGTCA TCTTGTTCTG TTGATGCCAC TGGAACAGGA 7740 
TTTTCAGCAG TCTTGCAAAG AACATCTAGC TGAAAACTTT CTGCCATTCA ATATTCTTAC 7800 
CAGTTCTTCT TGTTTGAGGT GAGC CAT AAA TTACTAGAAC TTCGTCACTG ACAAGTTTAT 7860 
GCATTTTATT ACTTCTATTA TGTACTTACT TTGACATAAC ACAGACACGC ACATATTTTG 7920 
CTGGGATTTC CACAGTGTCT CTGTGTCCTT CACATGGTTT TACTGTCATA CTTCCGTTAT 7980 
AACCTTGGCA ATCTGCCCAG CTGCCCATCA CAAGAAAAGA GATTCCTTTT TTATTACTTC 8040 
TCTTCAGCCA ATAAACAAAA TGTGAGAAGC CCAAACAAGA ACTTGTGGGG CAGGCTGCCA 8100 
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TCAAGGGAGA GACAGCTGAA GGGTTGTGTA 
TGTCAGACAG TTTTGCCTGA TTTATACAGG 
AGGCCACCTT GCAGTCCTTG GTTTGTAAGA 
CGTGGAGAAT CATGATGGCA GTTCTTGCTG 
CAGCAAAGTA ACACTTGCTG CTGTAGGTGC 
CACCA^GATG AGGGATGCTC CCAGCTGACG 
CTGCCTGCTC ATTAGCATCA CCTCAGCCCT 
TGAGGAAAGT TGCTCATCTT CTTCACATCA 
GATGCTTAAA TGTGGTCACT GACATCTTTA 
GATCAGGAGG GAACACATAG TGGGAATGTA 
ATGATCATGC ATGCTACTTA GGAAGGTGTG 
TTTTTCTTCC TGCTGTCAGG AACATTTTGA 
GGCATGGGAG GAGTTGTCAC ACTTGCAAAA 
TCAGGGTCTG AAGGAGGATC AGAAACTGTG 
TTTTGAAAGC TGTTCCTGGC CGAGGCAGTA 
TGTCTTCAAG GTGCAGCAGG AGGAAACACC 
CGCTGAAGGA ATCCAGCTCC TGTTTGAGCA 
GTTCATTTTT ATAGGACTTC CAGGAAGGAT 
TCTCCAGTTG GCAGATGACT ATGACTACTG 
TTCTGTTTGA CCACCATGGA GTCACCCATT 
GAATTGCAAA GCAGGAGTTA GCGAAGATCT 
TCTGGCTATG AAAGTCTGCT TACAAGGAAG 
AGTTTGAAGA CAATGAGGTT TTAGCTGCAT 
ATAGCTATGG TATTTACGTG TCTTTTTGCT 
GTATGAACTC AGGTCTCTCG GGCTACTGGC 
GCAGTGATTT AGGGTTTATG AGTACTTTTG 
TCAGGGAAAA AAAAAAAAAG CCAACCCTGA 
ATCACAGCTC AGTGCGGTCC CAGAGAACAC 
AGGGCCTCAA GATAACTGAT GTTAGTCAGA 
AGGCAATCCT GGAATTTTCT CTCCGCTGCA 
TGGCACTTTT TGGGTCAGGC CGTGAT CCAA 
TGCCTGACCG TCCCAACTCA CTGCACTCAA 
TTGAAATTGC AGTGTGGCCC AGAGGGGCTG 
TTAATCCTCA GCAAGTGCAA TTTGCAGCCA 
ATCAGTATCA ACAAGTGGTT TGGCTTGGAA 
TACTCTCTAA TGGAGTTGCA TTTTGAAGCA 
GGCTGCTAAA CATTAGGGTC AATTTTCCAG 
TGCAAAGCTG CCCAAACATA GCACTTCCAA 
TCTTGCCAGC ACTGTCCTTC TCAAATGAAC 
GTAACAAGCT TTGAATGTCA TTAAAAAGTA 
GCCCACTAGA AACATCTTGT ACAAGCTGAA 
ACTTTATACA ATCATAGAAT CATAGAATGG 
AAGATCCAAC ACCCCCGCCA CAGGCAGGGC 
GCAGCCCAGG GCTCCATCCA ACCTGGCCAT 
CTCTCTGGGC AGCCTGTGCC AGCACCTCAC 
ATCCAATCTA AGCCTTCCCT CCTTGAGGTT 



GCTCAATAGA ATTAAGAAAT AATAAAGCTG 8160 
CACGCCCCAA GCCAGAGAGG CTG7CTGCCA 822 0 
TAAGTCATAG GTAACTTTTC TGG7GAATTG 828 0 
TTTACTATGG TAAGATGCTA . AAATAGGAGA 8340 
TCTGCTATCC AGACAGCGAT GGCACTCGCA 840 0 
GATGCTGGGG CAGTAACAGT GGGTCCCATG 8460 
CACCAGCCCA TCAGAAGGAT CATCCCAAGC 8520 
TCAAACCTTT GGCCTGACTG ATGCCTCCCG 858 0 
TTTTTCTATG ATTTCAAGTC AGAACCTCCG 864 0 
CCCTCAGCTC CAAGGCCAGA TCTTCCTTCA 870 0 
TGTGTGTGAA TGTAGAATTG CCTTTGTTAT 8760 
ATACCAGAGA AAAAGAAAAG TGCTCTTCTT 8820 
TAAAGGATGC AGTCCCAAAT GTTCATAATC 8880 
TATACAATTT CAGGCTTCTC TGAATGCAGC 894 0 
CTAGTCAGAA CCCTCGGAAA CAGGAACAAA 9000 
TTGCCCATCA TGAAAGTGAA TAACCACTGC 9060 
GGTGCTGCAC ACTCCCACAC TGAAACAACA 9120 
CTTCTTCTTA AGCTTCTTAA TTATGGTACA 9180 
ACAGGAGAAT GAGGAACTAG CTC-GGAATAT 924 0 
TCTTTACTGG TATTTGGAAA TAA7AATTCT 93 0 0 
TCATTTCTTC CATGTTGGTG _ ACAGCACAGT 93 60 
AGGATAAAAA TCATAGGGAT AATAAATCTA 942 0 
TTGACATGAA GAAATTGAGA CC7C7ACTGG 948 0 
TAGTTACTTA TTGACCCCAG CTGAGGTCAA 9540 
ATGGATTGAT TACATACAAC TGTAATTTTA 9600 
CAGTAAATCA TAGGGTTAGT AA7GTTAATC 9660 
CAGACATCCC AGCTCAGGTG GAAATCAAGG 9720 
AGGGACTCTT CTCTTAGGAC CTT7ATGTAC 9780 
AGACTTTCCA TTCTGGCCAC AG77CAGCTG 9840 
CAGTTCCAGT CATCCCAGTT TG7ACAGTTC 9900 
GGAGCAGAAG TTCCAGCTAT GG7CAGGGAG 9960 
ACAAAGGCGA AACCACAAGA GTGGCTTTTG 10020 
CACCAGTACT GGATTGACCA GGAGGCAACA 10080 
TTAAATTGAA CTAACTGATA CTACAATGCA 10140 
G AT GG AG T CT AGGGGCTCTA CAG3AGTAGC 10200 
GGACACTGTG AAAAGCTGGC C7CCTAAAGA 10260 
TGCACTTTCT GAAGTGTCTG CAG7TCCCCA 10320 
TTGAATACAA TTATATGCAG GCG7ACTGCT 10380 
TCAACAAACA ATTTCAAAGT CTAGTAGAAA 10440 
TATCTGCTTT CAGTAGTTCA GC7TATTTAT 10500 
CACTGGGGCT CCAGATTAGT GG7AAAACCT 10560 
CCTGGGTTGG AAGGGACCCC AAGGATCATG 10620 
CACCAACCTC CAGATCTGGT AC7AGACCAG 10680 
GAACACCTCC AGGGATGGAG CATCCACAAC 10740 
CACCCTCTCT GTGAAGAACT TT7CCCTGAC 10800 . 
AGATCCACTC CCCCTTGTGC TA7CACTGTC 10860 
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TACTCTTGTA AAAAGTTGAT TCTCCTCCTT TTTGGAAGGT TGCAATGAGG TCTCCTTGCA 10920 
GCCTTCTTCT CTTCTGCAGG ATGAACAAGC CCAGCTCCCT CAGCCTGTCT TTATAGGAGA 10980 
GGTGCTCCAG CCCTCTGATC ATCTTTGTGG CCCTCCTCTG GACCCGCTCC AAGAGCTCCA 11040 
CATCTTTCCT GTACTGGGGG CCCCAGGCCT GAATGCAGTA CTCCAGATGG GGCCTCAAAA • 11100 
GAGCAGAGTA AAGAGGGACA ATCACCTTCC TCACCCTGCT GGCCAGCCCT CTTCTGATGG 11160 
AGCCCTGGAT ACAACTGGCT TTCTGAGCTG CAACTTCTCC TTATCAGTTC CACTATTAAA 11220 
ACAGGAACAA TACAACAGGT GCTGATGGCC AGTGCAGAGT TTTTCACACT TCTTCATTTC 11280 
GGTAGATCTT AGATGAGGAA CGTTGAAGTT GTGCTTCTGC GTGTGCTTCT TCCTCCTCAA 113 4 0 
ATACTCCTGC CTG^TACCTC ACCCCACCTG CCACTGAATG GCTCCATGGC CCCCTGCAGC 114 0 0 
CAGGGCCCTG ATGAACCCGG CACTGCTTCA GATGCTGTTT AATAGCACAG TATGACCAAG 114 60 
TTGCACCTAT GAATACACAA ACAATGTGTT GCATCCTTCA GCACTTGAGA AGAAGAGCCA 1152 0 
AATTTGCATT GTCAGGAAAT GGTTTAGTAA TTCTGCCAAT TAAAACTTGT TTATCTACCA 11580 
TGGCTGTTTT TATGGCTGTT AGTAGTGGTA CACTGATGAT GAACAATGGC TATGCAGTAA 1164 0 
AATCAAGACT GTAGATATTG CAACAGACTA TAAAATTCCT CTGTGGCTTA GCCAATGTGG 11700 
TACTTCCCAC ATTGTATAAG AAATTTGGCA AGTTTAGAGC AATGTTTGAA GTGTTGGGAA 11760 
ATTTCTGTAT ACTCAAGAGG GCGTTTTTGA CAACTGTAGA ACAGAGGAAT CAAAAGGGGG 11820 
TGGGAGGAAG TTAAAAGAAG AGGCAGGTGC AAGAGAGCTT GCAGTCCCGC TGTGTGTACG 11880 
ACACTGGCAA CATGAGGTCT TTGCTAATCT TGGTGCTTTG CTTCCTGCCC CTGGCTGCCT 11940 
TAGGG 11945 
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AAAGTCTAGAGTCGGGGCGGCCGGCCGCTTCGAGCAGACATGATAAGATACATTGATGAG 60 

™gacaaaccacaactagaatgcagtgaaaaa^tgctttatttgtga^^ 120 

rrTATTGCOTTATTTCTAACCATTATAAGCTGCAATAAACAAGTTAACAACAACAATTGC 1 8 0 

a ScaStt™SSLggggag 240 

CTCTACAAATGTGGTAAAATCGATAAGGATCCGTCGAGCGGCCGC Z 8 5 
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1 CGCGTGGTAGGTGGCGGGGGGTTCCCAGGAGAGCCCCCAGCGCGGACGGC 
AGCGCCGTCACTCACCGCTCCGTCTCCCTCCGCCCAGGGTCGCCTGGCGC 
AACCGCTGCAAGGGCACCGACGTCCAGGCGTGGATCAGAGGCTGCCGGCT 
GTG*K1GAGCTGCCGCGCCCGGCCCGCCCGCTGCACAGCCC-GCCGCTTTGC 
200 GAGCGCGACGCTACCCGCTTGGCAGTTTTAAACGCATCCCTCATTAAAAC 
GACTATACGCAAACGCCTTCCCGTCGGTCCGCGTCTCTTTCCGCCGCCAG 
GGCGACACTCGCGGGGAGGGCGGGAAGGGGGCCGGGCGGGAGCCCGCGGC 
CAACCGTCGCCCCGTGACGGCACCGCCCCGCCCCCGTGACGCGGTGCGGG 
400 CGCCGGGGCCGTGGGGCTGAGCGCTGCGGCGGGGCCGG3CCGGGCCGGGG 
CGGGAGCTGAGCGCGGCGCGGCTGCGGGCGGCGCCCCCTCCGGTGCAATA 
TGTTCAAGAGAATGGCTGAGTTCGGGCCTGACTCCGGGGGCAGGGTGAAG 
GTGCGGCGCGGGCGGAGGGACGGGGCGGGCGCGGGGCCGCCCGGCGGGTG 
6 0 0 CCGGGGCCTCTGCCGGCCCGCCCGGCTCGGGCTGCTC-CGGCGCTTACGGG 
CGCGCTTCTCGCCGCTGCCGCTTCTCTTCTCTCCCGCGCAAGGGCGTCAC 
CATCGTGAAGCCGGTAGTGTACGGGAACGTGGCGCGGTACTTCGGGAAGA 
AGAGGGAGGAGGACGGGCACACGCATCAGTGGACGGTTTACGTGAAGCCC 
800 TACAGGAACGAGGTAGGGCCCGAGCGCGTCGGCCGCCGTTCTCGGAGCGC 
CGGAGCCGTCAGCGCCGCGCCTGGGTGCGCTGTGGGACACAGCGAGCTTC 
TCTCGTAGGACATGTCCGCCTACGTGAAAAAAATCCAGTTCAAGCTGCAC 
GAGAGCTACGGGAATCCTCTCCGAGGTGGGTGTTGCGTCGGGGGGTTTGC 
1000 TCCGCTCGGTCCCGCTGAGGCTCGTCGCCCTCATCTT7CTTTCGTGCCGC 
AGTCGTTACCAAACCGCCGTACGAGATCACCGAAACGC-GCTGGGGCGAAT 
TTGAAATCATCATCAAGATATTTTTCATTGATCCAAACGAGCGACCCGTA 
AGTACGCTCAGCTTCTCG7AGTGCTTCCCCCGTCCTGGCGGCCCGGGGCT 
1200 GGGCTGCTCGCTGCTGCCGGTCACAGTCCCGCCAGGCGCGGAGCTGACTG 
AGCTCCCTTTCCCGGGACGTGTGCTCTGTGTTCGGTCAGCGAGGCTATCG 
GGAGGGCTTTGGCTGCATTTGGCTTCTCTGGCGCTTAGCGCAGGAGCACG 
TTGTGCTACGCCTGAACTACAGCTGTGAGAAGGCCGTGGAAACCGCTCTC 
1400 AAACTGATTTATTGGCGAAATGGCTCTAAACTAAATCC-TCTCCTCTCTTT 
GGAAATGCTTTAGAGAAGGTCTCTGTGGTAGTTCTTATGCATCTATCCTA 
AAGCACTTGGCCAGACAATTTAAAGACATCAAGCAGC-.TTTATAGCAGGC 
ACGTTTAATAACGAATACTGAATTTAAGTAACtCTGCTCACGTTGTATGA 
1600 CGTTTAT^CGTATTCCTGAAAGCCATTAAAATCCTC-rGCAGTTGTTTA 
GTAAGAACAGCTGCCACTGTTTTGTATCTAGGAGATA.-.CTGGTGTTTCCC 
TACAGTTCTCAAGCTGATAAAACTCTGTCTTTGTATCTAGGTAACCCTGT 
ATCACTTGCTGAAGCTTrTTCAGTCTGACACCAATGCAATCCTGGGAAAG 
1800 AAAACTGTAGTTTCTGAAT7CTATGATGAAATGGTATGAAAATTTTAATG 
TCAACCGAGCCTGACTTTATTTAAAAAAAATTATTGATGGTGCTGTGTAT 
TTTGGTCCTTCCTTAGATATTTCAAGATCCTACTGCCATGATGCAGCAAC 
TGCTAACGACGTCCCGTCAGCTGACACTTGGTGCTTACAAGCATGAAACA 
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2000 GAGTGTAAGTGCAAAATGAGGATACCTTCGCCGACCGTCATTCACTACTA 
ATG-TTTCTGTGGGATGTGATCGTACAGTGAGTTTGGCTGTGTGAAATTT 
GAATAGCTTGGTATTGGCAGTGATGACGTGATCGATGCCTTGCTTATCAT 
GTTTGAAATGAAGTAGAATAAATGCAGCCTGGTTTATTTGAGATAGTTTG 

2200 GTTCATTTTATGGAATGCAAGCAAAGATTATACTTCCTCACTGAATTGCA 
CTG^CCAAAGGTGTGAAATGTGTGGGGATCTGGAGGACCGTGACCGAGGG 
ACATTGGATCGCTATCTCCCATTTCTTTTGCTGTTACCAGTTCAGATTTT 
CTTT^CACCTAGTCTTTAATTCCCAGGGTTTTGTTTTTTCCTTGGTCATA 

2400 GTTTTTGTTTTTCACTCTGGCAAATGATGTTGTGAATTACACTGCTTCAG 
CCACAAAACTGATGGACTGAATGAGGTCATCAAACAAACTTTTCTTCTTC 
CGTATTTCCTTTTTTTTCCCCCACTTATCATTTTTACTGCTGTTGTTGAG 
T CTG^ AAGGCTAAAAGT AACTGTTTTGTGCTTTTTCAGGACGTGTGCTTT 
• 2600 CCAAATTACTGCCACATATATAAAGAAAGGTTGGAATTTTAA^GATAATT 
CATGTTTCTTCTTCTTTTTTGCCACCACAGTTGCAGATCTTGAAGTAAAA 
ACCAGGG^AAAGCTGGAAGCTGCCAAAAAGAAAACCAGTTTTGAAATTGC 
TGAG r TTAAAGAAAGGTTAAAAGCAAGTCGTGAAACCATCAA.CTGCTTAA 

2800 AGAGTGAAATCAGAAAACTCGAAGAGGATGATCAGTCTAAAGATATGTGA 
TGAGTGTTGACTTGGCAGGGAGCCTATAATGAGAATGAAAGGACTTCAGT 
CGTGGAGTTGTATGCGTTCTCTCCAATTCTGTAACGGAGACTGTATGAAT 
TTC aT TTGCAAATCACTGCAGTGTGTGACAACTGACTTTTTATAAATGGC 

3000 AGA.-*^ ACAAGAATGAATGTATCCTCATTTTATAGTTAAAATCTATGGGTA 
TGTACTGGTTTATTTCAAGGAGAATGGATCGTAGAGACTTGGAGGCCAGA 
TTGC^GCTTGTATTGACTGCATTTGAGTGGTGTAGGAACATTTTGTCTAT 
GGTCCCGTGTTAGTTTACAGAATGCCACTGTTCACTGTTTTGTTTTGTAT 

3200 TTTACTTTTTCTACTGCAACGTCAAGGTTTTAAAAGTTGAAAATAAAACA 
TGCAGGTTTTTTTTAAATATTTTTTTGTCTCTATCCAGTTTGGGCTTCAA 
GTATTATTGTTAACAGCAAGTCCTGATTTAAGTCAGAGGCTGAAGTGTAA 
TGGTATTCA^GATGCTTAAGTCTGTTGTCAGCAAAACAAAAGAGAAAACT 

3 400 TCAT a AAATCAGGAAGTTGGCATTTCTAATAACTTCTTTATCAACAGATA 
AGAGTTTCT=lGCCCTGCATCTACTTTCACTTATGTAGTTGATGCCTTTAT 
ATTTTGTGTGTTTGGATGCAGGAAGTGATTCCTACTCTGTTATGTAGATA 
TTCTATTTAACACTTGTACTCTGCTGTGCTTAGCCTTTCCCCATGAAAAT 
' 3600 TCAGCGGCTGTAAATCCCCCTCTTCTTTTGTAGCCTCATACAGATGGCAG 
ACCC^CAGGCTTATAAAGGCTTGGGCATCTTCTTTACTGCTTTGAGATTC 
TGTG^TGCAGTAACCTCTGCCAGAGAGGAGAAAAGCCCCACAAACCTCAT 
CCCC r TCTTCTATAGCAATCAGTATTACTAATGCTTTGAGAA.CAGAGCAC 

3800 TGGT^G AAACGTTTGATAATTAGCATTTAACATGGCTTGGTAAAGATGC 
' AG A a CTGAAACAGCTGTGACAGTATG AAGT CAGT ATGGAGACTT CATT AA 
GACAAACAGCTGTTAAAATCAGGCATGTTTCATTGAGGAGGACGGGGCAA 
CTTGCACCAGTGGTGCCCACACAAATCCTTCCTGGCGCTGCAGACCAATT 
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a 0 0 0 TTTCTGGCATTCTGACTGCCGTTGCTGCTGGTCACAGAGAGCAACTATTT 
TTATCAGCCACAGGCAATTTGCTTGTAGTATTTTCCAAGTGTTGTAGGTA 
AGTATAAATGCATCGGCTCCAGAGCACTTTGAGTATACTTATTAAAAACA 
TAAATGAASGACAAATTAGCTTTGCTTGGGTGCACAGAACATTTTTAGTT 
-1200 CCAGCCTGCTTTTTGGTAGAAGCCCTCTTCTGAGGCTAGAACTGACTTTG 
ACAAGTAGAGAAACTGGCAACGGAGCTATTGCTATCGAAGGATCCTTGTT 
AACAAAGTTAATCGTCTTTTAAGGTTTGGTTTATTCATTAAATTTGCTTT 
TAAGCTGTAGCTGAAAAAG.AACGTGCTGTCTTCCATGCACCAGGTGGCAG 
4^00 CTCTGTGCAAAGTGCTCTCTGGTCTCACCAGCCTTTTAATTGCCGGGATT 
' CTGGCACGTCTGAGAGGGCTCAGACTGGCTTCGTTTGTTTGAACAGCGTG 
TACTGCTTTCTGTAGACATGGCCGGTTTCTCTCCTGCAGCTTATGAAACT 
GTTCACACTGAACACACTGGA^CAGGTTGCCCAAGGAGGCCGTGGATGCC 
4600 CCATCCCTGGAGGCATTCAAGGCCAGGCTGGATGTGGCTCTGGGCAGCCT 
GGTCTGGTGGTTGGCGATCCTGCAGATAGCAGCGGGGTTGAAACTCGATG 
ATCACTGTGGTCCTTTTCAACCCAGGCTATTCTATGATTCTATGATTCAA 
CAGCAAATCATATGTACTGAGAGAGGAAACAAACACAAGTGCTACTGTTT 
4800 GCAAGTTTTGTTCATTTGGT AAAAGAGTCAGGTTTTAAAATTCAAAATCT 
GTCTGGTTTTGGTGTTTTTTTTTTTTTATTTATTATTTCTTTGGGGTTCT 
TTTTGATGCTTTATCTTTCTCTGCCAGGACTGTGTGACAATGGGAACGAA. 
AAAGAACATGCCAGGCACTGTCCTGGATTGCACACGCTGGTTGCACTCAG 
5000 TAGCAGGCTCAGAACTGCCAGTCTTTCCACAGTATTACTTTCTA^ACCTA 
ATTTTAATAGCGTTAGTAGACTTCCATCACTGGGCAGTGCTTAGTGAATG 
CTCTGTGTGAACGTTTTACTTATAAGCATGTTGGAAGTTTTGATGTTCCT 
GGATGCAGTAGGGA^GGACAGATTAGCTATGTGAAAAGTAGATTCTGAGT 
5200 ATCGGGGTTACAAAAA.GTATAGAAACGATGAGAAATTCTTGTTGTAACTA 
ATTGGAATTTCTTTA^GCGTTCACTTATGCTACATTCATAGTATTTCCAT 
TTAAAAGTAGGAAAAGGTAAAACGTGAAATCGTGTGATTTTCGGATGGAA 
CACCGCCTTCCTATGCACCTGACCAACTTCCAGAGGAAAAGCCTATTGAA 
5400 AGCCGAGATTAAGCCACCAAAAGAA.CTCATTTGCATTGGAATATGTAGTA 
TTTGCCCTCTTCCTCCCGGGTAATTACTATACTTTATAGGGTGCTTATAT 
GTTAAATGAGTGGCTGGCACTTTTTATTCTCACAGCTGTGGGGAATTCTG 
TCCTCTAGGACAGAAACAATTTTAATCTGTTCCACTGGTGACTGCTTTGT 
S S 0 0 CAGCACTTCCACCrGAAGAGATCAATACACTCTTCAATGTCTAGTTCTGC 
AACACTTGGCAAACCTCACATCTTATTTCATACTCTCTTCATGCCTATGC 

^aSaaagcaataatctgggtaatttttgttttaatcactgtcctgacc 

SSGtrG^GTGTCCCACCTAAAGCTCAATTCAGGTCCTG^TCTCT 
.900 TC^CTCTCTATAGCTAACATGAAGAATCTTCAAAAGTTAGGTCTGAGGG 
ACTTAAGGCTAACTGT AGATGTTGTTGC CTGGTTTCTGTGCTGAAGGCCG 
TGTAGTAGTTAGAGCATTC^ACCTCTAG 
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SEQ ID NO: 10 

, tgccGCCTTCTTTGATATTCACTCTGTTGTATTTCATCTCTTCTTGCCC^ 

?gaSSSat^ 

to^Sgatcagtgtttttataagtaatgttgaatattggataaggctg 

„1 ^^^^^^^ ^^^^^'^^^^^^^^cTCTT^^TTTT^T 
rrTGGCAGCTCAGTGACAGGAGAGGTTTTTTTGCCTGTTTi iTTTTTi - 1 

TTTTTTTTTTAAGTAAGGTGTTCTTTTTTCTTAGTAAATTTTCTACTGGA 

3 oi cStatgttttgacaggtcagaaacatttcttcaaa^g.^ccttttg 
™Sgtacagcccttttctttcattccctttttgctttctgtgcc.^t 

A 51 ?™?SIgtgtggcttgaaagcttggatagctgttgtt A cacgagat 
ISSIt^ttaggccagcttgatgctttattttttcc^ 



arTGAGCGTTCTCTGGTTTTTTTCCTTTGAAACTGGTGAGC-CTTAGAT i i 

sol SSaISSSS^^ 



tatacttgtcatctgtgtttctagtaaaaatatatggcatttatagaaat 

751 A^GTAATTCCTGATTTCCTTTTTTTTTATCTCTATGCTCTGTGTGTACAG 



gtc1aacagacttcactcctatttttatttatagaattttatatg^.gtc 

TGTCGTTG^TTCTTGTGTTGTAAGGATACAGCCTTAAATTTCCTAGAGCG 
TTTGGCTGCTGCCTTCCCGAGATCCAGGACACTAAACTGCT i CTGCAC iG 

aSaaatcgc^cagatcccagggaagtgcagatccacgtgcata^ 

Jgca^mS^acttaaattattttggtaacggagtgcatagg 
tt^aaacacagSgcagcatgctaacgagtcacagcgtttatgcagaag 
i,m SSg^^tgcctgttgcagctgtttacggcactgccttgcagtgag 
ca^cgScttaggg^^ggggtgctttgtgtcgtgttcccacacgctgcca 



1351 



1501 aScaccac^aWgcaggtgtaggctct^ 

^ATCTTTTAAAGCTTCTTTGGAAATACACTGACTTGATTGAAGiCx 
VAGATAGTAAACAGI 
1651 SS^^^TATTCAAGT^C^CT 



r TTPAATCTTTT AAAGCTTC 

"AAGATAGtAAACAGTACTTACCTTTGATCCCAATGAAATCGAGCAT 

lOTTO^AAAAGAATTdCGCCTATTCATACCATGTAATGTAATTTTAC 



^acCCTCTTGTO^^CTGTATTTTGTAATAGAAAATATTTTAAACTGTGCA 
aAA^^G^^^^GTGCGTGTGCTTTTATAAATACAAGTGATTGCAAATTAGT 

S^S^taaaaaaaaaaaw 

^SIc^Sgaaa^cattcct^ 
aaga^a^^mcgctgctgactttctaaacataaggctgtattgtckcc 

^tacS^^tttcctcattgccaatttgcacaaggatgtctgggwa 
2 ioi actattcaag^aatggctttgaaatacagcatgggagcttgtctgag 

^naT^AGAGTTCCACTGCAAAATGTCAGGAAATGGATGTCTCTCAGAAT 
GC^CAACTCCAAAGGA^TTATATGTGTATATAGTAAGCAGTTTCCTGAT 
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22 51 TCCAGCAGGCCAAAGAGTCTGCTGAATGTTGTGTTGCCGGAGACCTGTAT 
TTCTCAACAAGGTAAGATGGTATCCTAGCAACTGCGGATTTTAATACATT 
TTCAGCAGAAGTACTTAGTTAATCTCTACCTTTAGGGATCGTTTCATCAT 

2401 TTTTAGATGTTATACTTGAAATACTGCATAACTTTTAGCTTTC^TGGGTT 
CCTTTTTTTCZAGCCTTTAGGAGACTGTTAAGCAATTTGCTGTCCAACTTT 
TGTGTTGGTCTTAAACTGCAATAGTAGTTTACCTTGTATTGAAGAAATAA 

2551 AGACC ATTTTT ATATTAAAAAAT ACTTTTGTCTGTCTTC ATTT TGACTTG 
TCTGATATCCTTGCAGTGCCCATT ATGTCAGTTCTGT CAGATATT CAGAC 
ATCAAAACTTAACGTGAGCTCAGTGGAGTTACAGCTGCGGTTTTGATGCT 

27 01 GTTATTATTTCTGAAACTAGAAATGATGTTGTCTTCATCTGCTCATCAAA 

CACTTCATGCAGAGTGTAAGGCTAGTGAGAAATGC^TACATTTATTGATA 
CTTTTTTAAAGTCAACTTTTTATCAGATTTTTTTTTCATTTGGAAATATA 

28 51 TTGTTTTCTAGACTGCATAGCTTCTGAATCTGAAATGCAGTCTGATTGGC 

ATGAAGAAGCACAGCACTCTTCATCTTACTTAAACTTCATTTTGGAATGA 
AGGAAGTTAAGCAAGGGCACAGGTCCATGAAATAGAGACAGTGCGCTCAG 

3001 GAGAAAGTGAACCTGGATTTCTTTGGCTAGTGTTCTAAATCTGTAGTGAG 
GAAAGTAACACCCGATTCCTTGAAAGGGCTCCAGCTTTAATGCTTCCAAA 
TTGAAGGTGGCAGGCAACTTGGCCACTGGTTATTTACTGCAT7ATGTCTC 

3151 AGTTTCGCAGCTAACCTGGCTTCTCCACTATTGAGCATGGACTATAGCCT 
GGCTTCAGAGGCCAGGTGAAGGTTGGGATGGGTGGAAGGAGTGCTGGGCT 
GTGGCTGGGGGGACTGTGGGGACTCCAAGCTGA.GCTTGGGGTGGGCAGCA 

33 01 CAGGGAAAAGTGTGGGTAACTATTTTTAAGTACTGTGTTGC-AACGTCTC 
ATCTGCAAATACGTAGGGTGTGTACTCTCGAAGATTAACAGTGTGGGTTC 
AGTAATATATGGATGAATTCACAGTGGAAGCATTCAAQGGTA.GATCATCT 

3451 AACGACACCAGATCATCAAGCTATGATTGGAAGCGGTATCAGAA.GAGCGA 
GGAAGGTAAGCAGTCTTCATATGTTTTCCCTCCACGTAAAGCAGTCTGGG 
AAAGT AGCACC C CTTGAGC AGAGACAAGGAAATAAT7 CAGGAGCATGTGC 

3 601 TAGGAGAACTTTCTTGCTGAATTCTACTTGCAAGAGCTTTGATGCCTGGC 
TTCTGGTGCCTTCTGCAGCACCTGCAAGGCCCAGAGCCTGTGGTGAGCTG 
G AGGGAAAG ATTC TGCTCAAGTCCAAGCTT C AG CAGGTCATTG T CTTTGC 

3751 TTCTTCCCCCAGCACTGTGCAGCAGAGTGGAACTGATGTCGAAGCCTCCT 
GTCCACTACCTGTTGCTGCAGGCAGACTGCTCTCAGAAAAAGAGAGCTAA 
CTCTATGCCATAGTCTGAAGGTAAAATGGGTTTTAAAM 

3 901 AGGCAAAACCGGGTGCCCCATGAGAAGAAAGCAGTGGTAAACATGGTAGA 
AAAGGTGCAGAAGCCCCCAGGCAGTGTGACAGGCCCCTCCTGCC^CCTAG 
AGGCGGGAACAAGCTTCCCTGCCTAGGGCTCTGCCCGCGAAGTGCGTGTT 

4051 TCTTTGGTGGGTTTTGTTTGGCGTTTGGTTTTGAGATTTAGACACAAGGG 
' AAGCCTGAAAGGAGGTGTTGGGCACTATTTTGGTTTGTAAAGCCTGTACT 
TCAAATATATATTTTGTGAGGGAGTGTAGCGAATTGGCCAATTTAAAATA 

42 01 AAGTTGCAAGAGATTGAAGGCTGAGTAGTTGAGAGGGTAACACGTTTAAT 
GAGATCTTCTGAAACTACTGCTTCTAAACACTTGTTTGAGTGGTGAGACC 
TTGGATAGGTGAGTGCTCTTGTTACATGTCTGATGCACTTGCTTGTCCTT 

4351 TTCCATCCACATCCATGCATTCCACATCCACGCATTTGTCACTTATCCCA 
TATCTGTCATATCTGACATACCTGTCTCTTCGTCACTTGGTCAGAAGAAA 
CAGATGTGATAATCCCCAGCCGCCCCAAGTTTGAGAAGATGGCAGTTGCT 

4501 TCTTTCCCTTTTTCCTGCTAAGTAAGGATTTTCTCCTGGCTTTGACACCT 
CACGAAATAGTCTTCCTGCCTTACATTCTGGGCATTATTTCAAATATCTT 



FIG. 6B 



WO. 03/024199 PCT/US02/30156 

18/31 



TGGAGTGCGCTGCTCTCAAGTTTGTGTCTTCCTACTCTTAGAGTGAATGC 

4651 TCTTAGAGTGAAAGAGAAGGAAGAGAAGATGTTGGCCGGi 3TTCTCTGAT 
GAACACACCTCTGAATAATGGCCAAAGGTGGGTGGGTTTCTCTGAGGAAC 
GGGCAGCGTTTGCCTCTGAAAGC2lAGGAGCTCTGCGGAGTTGCAGTTATT 

4801 TTGCAACTGATGGTGGAACTGGTGCTT.AAAGCnGATTCCCTAGGTTCCCT 
GCTACTTCTTTTCCTTCTTGGCAGTCAGTTTATTTCTGACAGACAAACAG 
CCACCCCCACTGCAGGCTTAGAAAGTATGTGC-CTCTGCCTC-GGTGTGTTA 

4 951 CAGCTCTGCCCTGGTGAAAGGGGATTA^AACGGGCACCATTCATCCCAAA 
CAGGATCCT CATT CATGGATCAAGCTGT.-AGGAACTTGGGCTCCAACCTC 
AAAACATTAATTGGAGT ACGAATGT AATTAAAACTGCATT ct CGCATTCC 

5101 TAAGTCATTTAGTCTGGACTCTGCAGCATGTAGGTCGGCAGCTCCCACTT 
TCTCAAAGACCACTGATGGAGGAG7AGT AAAAATGGAGAC CGATT CAGAA 
CAACCAACGGAGTGTTGCCGAAGAAACTGATGGAAATAATGCATGAATTG 

5251 TGTGGTGGACATTTTTTTTAAATACATA £ -ACTACTTCAAATGAGGTCGGA 
GAAGGTCAGTGTTTTATTAGCAGCCATAAAACCAGGTGAGCGAGTACCAT 
TTTTCTCTACAAGAAAAACGATTCTGAGCTCTGCGTA^GTATAAGTTCTC 

5401 CATAGCGGCTGAAGCTCCCCCCTGGCTGCCTGCCATCTCAGCTGGAGTGC 
AGTGCCATTTCCTTGGGGTT TCT CTC^CAGC AGTAATGGGACAAT ACTT C 
ACAAAAATTCTTTCTTTTCCTGTCATGTGGGATCCCTACTGTGCCCTCCT 

5551 GGTTTT ACGTTAC CC C CTGACTGTTCCA7TC-.GCGGTTTGGAAAGAGAAA 
AAGAATTTGGAAATAAAACATGTCTACGTTATCACCTCCTCCAGCATTTT 
GGTTTTT AATT ATGT C AAT AACTGGCTT AGA7 TTGGAAAT GAGAGGGGGT 

5701 TGGGTGT ATTACCGAGGAACAAAGGAAGGCTT ATATAAACXC AAGTCTTT 
TATTTAGAGAACTGGCAAGCTGTCAAAA^C-AAAAGGCCTIACCACCAAA 
TTAAGTGAATAGCCGCTATAGCC^GC^GGGCCAGCACGAGGGATGGTGCA 

5851 CTGCTGGCACTATGCCACGGCCTGCl'TGTGACTCTGAGAGCAACTGCTTT 
GGAAATGACAGCACTTGGTGCAATTTCCTTTGITTCAGAATGCGTAGAGC 
GTGTGCTTGGCGACAGTTTTTCTAGTTAGGCCACTTCTTTTTTCCTTCTC 

6001 TCCTCATTCTCCTAAGCATGTCTCCATGCTGC-TAATCCCAGTCAAGTGAA 
CGTTCAAACAATGAATCCATCACTGTAGGATTCTCGTGGTGATCAAATCT 
TTGTGTGAGGTCTATAAAATATGG. a AGCTTATT7ATTTTTCGTTCTTCCA 

6151 TATCAGTCTTCTCTATGACAATTCACATCC^XCACAGCAAATTAAAGGTG 
AAGGAGGCTGGTGGGATGAAGAGGGTCTTCTAGCTTTACGTTCTTCCTTG 
CAAGGCCACAGGAAAATGCTGAGAGCTGTAGAATACAGCCTGGGGTAAGA 

6301 AGTTCAGT CT CCTGCTGGGACAGCT AACCGGAI CTT AT AACCCCTTCTGA 
GACT CATCTT AGGAC CAAAT AGGGTCT ATCTGGGGTTTTTGTT C CTGCTG 
TTCCTCCTGGAAGGCTATCTCACTA7TTCACTGCTCCCACG3TTACAAAC 

6451 CAAAGATACAGCCTGAATTTTTTCTAGGCC^C^TTACATAAATTTGACCT 
GGTACCAATATTGTTCTCTATATAGTTATTTCCTTCCCCACTGTGTTTAA 
CCCCTTAAGGCATTCAGAACAACTAGA^T CAT AGAATGGT TTGGATTGGA 

6601 AGGGGCCTTAAACATCATCCATTTCCAACCCTCTGCCATGGGCTGCTTGC 
CACCCACTGGCTC^GGCTGCCC^GGGCCCC^TCCAGCCTG'GCCTTGAGCA 
CCTCCAGGGATGGGGCACCCACAGCTTCTCTGGGCAGCCTGTGCCAACAC 

6751 CTCACCACTCTCTGGGTAAAGAATTCTCTTTT.AACATCTAATCTAAATCT 
CTTCTCTTTTAGTTTAAAGCCATTCCTCTTTTTCCCGTTGCTATCTGTCC 
AAGAAATGTGTATTGGTCTCCCTCCTGCTTATAAGCAGGAAGTACTGGAA 

6901 GGCTGCAGTGAGGTCTCCCCACAGCCTTCTCTTCTCCAGGCTGAACAAGC 



FIG. 6C 



WO 03/024199 PCT/US02/30156 

19/31 



CCAGCTCCTTCAGCCTGTCTTCGTAC-GAGATCATCTTAGTGGCCCTCCTC 
TGGACCCATTCCAACAGTTCCACGGC7TTCTTGTGGAGCCCCAGGTCTGG 

7051 ATGCAGTACTTCAGATGGGGCCTTACAAAGGCAGAGCAGATGGGGACAAT 
CGCTTACCCCTCCCTGCTGGCTGCCCCTGTTTTGATGCAGCCCAGGGTAC 
TGTTGGCCTTTCAGGCTCCCAGACCCCTTGCTGATTTGTGTCAAGCTTTT 

7201 CATCCACCAG AACCC ACGCTTCCTGGTTAZVTACTTCTGCCCTCACTTCTG 
TAAGCTTGTTTCAGGAGACTTCCATTCTTTAGGACAGACTGTGTTACACC 
TACCTGCCCTATTCTTGCATATATACATTTCAGTTCATGTTTCCTGTAAC 

7351 AGG ACAGAATATGTATTCCT CT AAC a AAAA.T ACATGCAGAATTC CT AGTG 
CCATCTCAGTAGGGTTTTCATGGCAGTATTAGCACATAGTCAATTTGCTG 
CAAGTACCTT CCAAGCTGCGGCCTCC CATAAAT CCTGTATTTGGGATCAG 

7501 TTACCTTTTGGGGTAAGCTTTTGTATCTGCAGAGACC CTGGGGGTT CTGA 
TGTGCTTCAGCTCTGCTCTGTICTGACTGCACCATTTTCTAGATCACCCA 
GTTGTTCCTGTACAACTTCCTTGXCCTCCATCCTTTCCCAGCTTGTATCT 

7651 TTGACAAATACAGGCCTATTT7TGTGTTTGCTTCAGCAGCCATTTAATTC 
TTCAGTGTCATCTTGTTCTGTTGATGCCACTGGAACAGGATTTTCAGCAG 
TCTTGCAAAGAACATCTAGCTGAAAACTTTCTGCCATTCAATATTCTTAC 

7801 CAGTTCTTCTTGTTTGAGGTGAGCCATAAATTACTAGAACTTCGTCACTG 
ACAAGTTTATGCATTTTATTACITCTATTATGTACTTACTTTGACATAAC 
ACAGACACGCACATATTTTGCTGGGATTTCCACAGTGTCTCTGTGTCCTT 

7951 *' CACATGGTTTTACTGT CAT ACTTCCGTTATAACCTTGGCAAT CTGCCCAG 
CTGCCCATCACAAGAAAAGAGATTCCTTTTTTATTACTTCTCTTCAGCCA 
AT AAACAAAATGTGAGAAGC CCA-AC AAG AACTTGTGGGGCAGGCTGC CA 

8101 TCAAGGGAGAGACAGCTGAAGGGTTGTGTAGCTCAATAGAATTAAGAAAT 
AATAAAGCTGTGTCAGACAGTirTGCCTGATTTATACAGGCACGCCCCAA 
GCCAGAGAGGCTGTCTGCCAAGGCCACCTTGCAGTCCTTGGTTTGTAAGA 

8251 TAAGTCATAGGTAACTTTTCTGGTGAATTGCGTGGAGAATCATGATGGCA 
GTTCTTGCTGTTTACTATGGTA-.GATGCTAAAATAGGAGACAGCAAAGTA 
ACACTTGCTGCTGTAGGTGCTC7GCTATCCAGACAGCGATGGCACTCGCA 

8401 CACCAAGATGAGGGATGCTCCC-.GC T GACGGATGCTGGGGCAGTAACAGT 
GGGTCCCATGCTGCCTGCTCATTA3CATCACCTCAGCCCTCACCAGCCCA 
TCAGAAGGATCATCCCAAGCTGAGGA^AGTTGCT CATCTTCT TCACATCA 

8551 TCAAACCTTTGGCCTGACTGATG2CTCCCGGATGCTTAAATGTGGTCACT 
GACAT CTTTATTTTT CTATG ATT TCAAGTCAG AACCT CCGGAT CAGGAGG 
GAACACATAGTGGGAATGTAC CCTCAGCT CCAAGGCCAGATCTT C CTTCA 

8701 ATGATCATGCATGCTACTTAGGA-JGGTGTGTGTGTGTGAATGTAGAATTG 
CCTTTGTTATTTTTTCTTCCTGCTGTC^GGAACATTTTGAATACCAGAGA 
AAAAGAAAAGTGCT CIT CTTGGCATGGGAGGAGTTGT CACACTTGCAAAA 

8851 TAAAGGATGCAGTCCCAAATGTTO.TAATCTCAGGGTCTGAAGGAGGATC 
AGAAACTGTGTATACAATTTOVGGCTTCTCTGAATGCAGCTTTTGAAAGC 
TGTTCCTGGCCGAGGCAGTACTAGTWGAACCCTCGGAAACAGGAACAAA 

9001 TGTCTTCAAGGTGCAGCAGGAGGAA^CAC CTTGC CCAT CATGAAAGTGAA 
TAACCACTGCCGCTGAAGGAATCC a .GCTCCTGTTTGAGCAGGTGCTGCAC 
ACTCC CACACTGAAACAACAGTTCATTTTT AT AGGACTTCCAGGAAGGAT 

9151 CTTCTTCTTAAGCTT CTTAATTATGGTACATCTCCAGTTGGCAGATGACT 
ATGACTACTGACAGGAGAATGAGGAACTAGCTGGGAATATTTCTGTTTGA 
CCACCATGGAGTCACCCATTTCTTTACTGGTATTTGGAAATAATAATTCT 
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9301 GAATTGCAAAGCAGGAGTTAGCGAAGATCTTCATTTCTTCCATGTTGGTG 
ACAGCACAGTTCTGGCTATGAAAGTCTGCTTACAAGGAAGAGGATAAAAA 

9 d o 1 TCATnGGGATAATAA\TCTAAGT*TGAAGACAATGAGGTTTTAGCTGCAT 
TTGACATGAAGAAATTGAGACerCTACTGGATAGCTATGGTATTTACGTG 
TCITT T TGCTTAGTTACTTATTGACCCCAGCTGAGGTCAAGTATGAACTC 
9551 AGGTCTCTCGGGCTACTGGCATGGATTGATTACATACAACTGTAATTTTA 
GC^GTGATTTAGGGTTTATGAGTACTTTTGCAGTAAATCATAGGGTTAGT 
AATGTTAATCTCAGGGAAAAAAAAAAAAAGCCAACCCTGACAGACATCCC 
9701 AGCTCaiGGTGGAAATCAAGGATCACAGCTCAGTGCGGTCCCAGAGAACAC 
AGGGACTCTTCTCTTAGGACCTTTATGTACAGGGCCTCAAGATAACTGAT 
GTT B GTC AG AAGACTTTCCATTCTGGCCACAGTTC AGCTGAGGC AATCCT 
9851 GGAATTTTCTCTCCGCTGCACAGTTCCAGTCATCCCAGTTTGTACAGTTC 
TGGCACTTTTTGGGTCAGGCCGTGATCCAAGGAGCAGAAGTTCCAGCTAT 
GGTCA GGG AGTGCCTGACCGTCCCAACTCACTGC ACT C AAAC AAAGGCG A 
10001 AACCACAAGAGTGGCTTTTGTTGAAATTGCAGTGTGGCCCAGAGGGGCTG 
CACCAGTACTGGATTGACCACGAGGCAACATTAATCCTCAGCAAGTGCAA 
TTTGCAGCCATTAAA.TTGAACTAACTGATACTACAATGCAATCAGTATCA 
10 151 ACAAGTGGTTTGGCTTGGAAGATGGAGTCTAGGGGCTCTACAGGAGTAGC 
XACTCTCTAATGGAiGTTGCATTTTGAAGCAGGACACTGTGAAAAGCTGGC 
CTCCTAAAGA.GGCTGCT.AAACATTAGGGTCAATTTTCCAGTGCACTTTCT 
10301 GAAGTGTCTGCAGTTCCCCATGCAAAGCTGCCCAAACATAGCACTTCCAA 
TTG* 2.TACAATT ^ATC-CAGGCGTACTGCTTCTTGCCAGCACTGTCCTTC 
TCAAATG a! ^CTCAACAAACA i \.TTTCAAAGTCTAGTAGAAAGTAACAAGCT 
10451 TTGAA T GTC^TTAAftAAGTATATCTGCTTTCAGTAGTTCAGCTTATTTAT 
GCCCACTBGAAAC^CTTGTACAAGCTGAACACTGGGGCTCCAGATTAGT 
GGTAAAACCTACTTTATACAATCATAGAATCATAGAATGGCCTGGGTTGG 
10601 AAGGGACCCCAAGGATCATGAAGATCCAACACCCCCGCCACAGGCAGGGC 
CACCAACCTCCAGATCTGGTACTAGACCAGGCAGCCCAGGGCTCCATCCA 
ACCTGGCCATGAACACCTCCAGGGATGGAGCATCCACAACCTCTCTGGGC 
10751 AGCCTGTGCCAGCACCTCACaVCCCTCTCTGTGAAGAACTTTTCCCTGAC 
ATCC^aTCTAAGCCTTCCCTCCTTGAGGTTAGATCCACTCCCCCTTGTGC 
TATCA^GTCT^CTCTTGTAAAAAGTTGATTCTCCTCCTTTTTGGAAGGT 
10901 TGCAATGAGGTCTCCTTGCAGCCTTCTTCTCTTCTGCAGGATGAACAAGC 
CCAGCTCCCTGAGCCTGTCTTTATAGGAGAGGTGCTCCAGCCCTCTGATC 
ATCTTTGTGGCCCTGCiCTGGACCCGCTCGAAGAGCTCCACATCTTTCCT 
11051 GTACTGGGGGCCCC^GGCCTGAATGCAGTACTCCAGATGGGGCCTCAAAA 
• GAGCAGAGTAAAGAGGGACAATCACCTTCCTCACCCTGCTGGCCAGCCCT 
CTTCTGATGGAGCCCTGGATACAACTGGCTTTCTGAGCTGCAACTTCTCC 
11201 TT AT CAGT T CCACT ATTAAAACAGGAACAAT ACAACAGGTGCTGATGGCC 
AGTGCAGAGTTTTTCACACTTCTTCATTTCGGTAGATCTTAGATGAGGAA 
CGTTGAAGTTGTGCTTCTGCGTGTGCTTCTTCCTCCTCAAA.TACTCCTGC 
11351 CTGATACCTCACCCCaCCTGCCACTGAATGGCTCCATGGCCCCCTGCAGC 
CAGGGCCCTGATGA^CCCGGCACTGCTTCAGATGCTGTTTAATAGCACAG 
TATGACCAAGTTGCACCTATGAATACACAAACAATGTGTTGCATCCTTCA 
11501 GCACTTGAGAAGAAGAGCCAA^TTTGCATTGTCAGGAAATGGTTTAGTAA 
TTCTGCCAATTAAAA.CTTGTTTATCTACCATGGCTGTTTTTATGGCTGTT 
AGTAGTGGTACACTGATGATGAACAATGGCTATGCAGTAAAATCAAGACT 
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11 651 GT^ATATTGCAACAGACTATAAAATTCCTCTGTGGCTTAGCCAATGTGG 
TAC^^CCCACATTGTATAAGAAATTTGGCAAGTTTAGAGCAATGTTTGAA 
GTGTTGGGAA^rTTCTGTATACTCAAGAGGGCGTTTTTGAOVACTGTAGA 

13 801 AC 2 ^-GGAATCAAAAGGGGGTGGGAGGAAGTTAAAAGAAGAGGCAGGTGC 
AAGAGAGCTTGCAGTCCCGCTGTGTGTACGACACTGGCAACATGAGGTCT 
TTG^SATCTTGGTGCTTTGCTTCCTGCCCCTGGCTGCCTTAGGGTGCGA 

IT 951 TCTGCCTCAGACCCACAGCCTGGGCAGCAGGAGGACCCTGATGCTGCTGG 
CT C-.GATGAGGAGAAT c AGC CTGTTT AGCTGC CTGAAGGAT AGGCACGAT 
TTTGGCTTTCCTCAAGAGGAGTTTGGCAACCAGTTTCAGAAGGCTGAGAC 

12101 CATCCCTGTGCTGCACGAGATGATCCAGCAGATCTTTAACCTGTTTAGCA 
CCA2.GGATAGCAGCGCTGCTTGGGATGAGACCCTGCTGGATAAGTTTTAC 
ACCG^GCTGTACCAGCAGCTGAACGATCTGGAGGCTTGCGTGATCCAGGG 

12251 CGTGGGCGTGACCGAGACCC CT CTGATGAAGGAGGAT AGC ATCCTGGCTG 
TGAGGAAGTACTTTCAGAGGATCACCCTGTACCTGAAGGAGAAGAAGTAC 
AGC CCCTGCGCTTGGGAAGT CGTGAGGGCTGAG AT CATGAGGAGCTTTAG 

12*01 CCTGAGCACCAACCTGCAAGAGAGCTTGAGGTCTAAGGAGTAAAAAGTCT 
AGac-TCGGGGCGGCGCGTGGTAGGTGGCGGGGGGTTCCCAGGAGAGCCCC 
CAGCGCGGACGGCAGCGCCGTCACTCACCGCTCCGTCTCCCTCCGCCCAG 

12551 GGTCGCCTGGCGC AACCGCTGCAAGGGCACCGACGTCCAGGCGTGGATCA 
GSVC-C^GCCGGCTGTGAGGAGCTGCCGCGCCCGGCCCGCCCGCTGCACAG 
" CCG^CGCTTTGCGAGCGCGACGCTACCCGCTTGGCAGTTTTAAACGCAT 

12701 CCC^CATTAAAACGACTATACGCAAACGCCTTCCCGTCGGTCCGCGTCTC 
TT t CCGCCGCCAGGGCGACACTCGCGGGGAGGGCGGGAAGGGGGCCGGGC 
GGGiGCCCGCGGCCAACCGTCGCCCCGTGACGGCACCGCCCCGCCCCCGT 

12351 G ACGCGGTGCGGGCGCCGGGGCCGTGGGGCTGAGCGCTGCGGCGGGGCCG 
GGCCGGGCCGGGGCGGGAGCTGAGCGCGGCGCGGCTGCGGGCGGCGCCCC 
CTC-GGTGCAATATGTTCAAGAGAATGGCTGAGTTCGGGCCTGACTCCGG 

13 001 GGG : ^GGGTGAAGGTGCGGCGCGGGCGGAGGGACGGGGCGGGCGCGGGGC 
CG r CCGGCGGGTGCCGGGGCCTCTGCCGGCCCGCCCGGCTCGGGCTGCTG 
CCKSCGCTTACGGGCGCGCTTCTCGCCGCTGCCGCTTCTCTTCTCTCCCGC 

13 151 GC^ : < iGGCGTCACCATCGTGAAGCCGGTAGTGTACGGGAACGTGGCGCGG 
TAC--CGGGAAGAAGAGGGAGGAGGACGGGCACACGCATCAGTGGACGGT 
TTA^GTGAAGCCCTACAGGAACGAGGTAGGrGC CCGAGCGCGT CGGCCGCC 

13 301 GTTC-CGGAGCGCCGGAGCCGTCAGCQCCGCGCCTGGGTGCGCTGTGGGA 

cac-^gagcttctctcgtaggacatgtccgcctacgtgaaaaaaatcca 
gtt^Lagctgcacgagagctacgggaatcctctccgaggtgggtgttgcg 
13*51 xcgg^ggtttgctccgctcggtcccgctgaggctcgtcgccctcatctt • 

TCT^CGTGCCGCAGTCGTTACCAAACCGCCGTACGAGATCACCGAAACG 
13551 GGC^GGGGCGAATTTGAAATCATCATCAAGATATTTTTCATTGATCCAAA 
CG^-'-G^CCCGTAAGTACGCTCAGCTTCTCGTAGTGCTTCCCCCGTCCTG 

gcg. : --ccggggctgggctgctcgctgctgccggtcacagtcccgccagcc 

13 701 GCC^AGCTGACTGAGCTCCCTTTCCCGGGACGTGTGCTCTGTGTTCGGTC 
AGCG^GGCTATCGGGAGGGCTTTGGCTGCATTTGGCTTCTCTGGCGCTTA 
GCGCiGGAGCACGTTGTGCTACGCCTGAACTACAGCTGTGAGAAGGCCGT 

13851 GG a - iCCGCTCTCAAACTGATTTATTGGCGAAATGGCTCTAAACTAAATC 
GTCTCCTCTCTTTGGAAATGCTTTAGAGAAGGTCTCTGTGGTAGTTCTTA 
TGCATCTATCCTAAAGCACTTGGCCAGACAATTTAAAGACATCAAGCAGC 

FIG. 6F 



WO 03/024199 



PCT/US02/30156 



22/31 



i 4001 ATTT^TAGCAGGCACGTTTAATAACGAATACTGAATTTAAGTAACTCTGC 
TCACGTTGTATGACGTTTATTTTCGTATTCCTGAAAGCCATTAAAATCCT 
GTGCAGTTGTTTAGTAAGAACAGCTGCCACTGTTTTGTATCTAGGAGATA 
-1151 ACTGGTGTTTCCCTACAGTTCTCAAGCTGATAAAACTCTGTCTTTGTATC 
TAGGTAACCCTGTATCACTTGCTGAAGCTTTTTCAGTCTGACACCAATGC 
AATCCTGGGAAAGAAAACTGTAGTTTCTGAATTCTATGATGAAATGGTAT 
i 4301 GAAAATTTTAATGTCAACCGAGCCTGACTTTATTTAAAAAAAATTATTGA 
TGGTGCTGTGTATTTTGGTCCTTCCTTAGATATTTCAAGATCCTACTGCC 
ATGATGCAGCAACTGCTAACGACGTCCCGTCAGCTGACACTTGGTGCTTA 
i "1151 CAAGCATGAAACAGAGTGTAAGTGCAAAATGAGGATACCTTCGCCGACCG 
TCATTCACTACTAATGTTTTCTGTGGGATGTGATCGTACAGTGAGTTTGG 
CTGTGTGAAATTTGAATAGCTTGGTATTGGCAGTGATGACGTGATCGATG 
i ^601 CCTTGCTTATCATGTTTGAAATGAAGTAGAATAAATGCAGCCTGCTTTAT 
TTGAGATAGTTTGGTTCATTTTATGGAATGCAAGCAAAGATTATACTTCC 
TCACTGAATTGCACTGTCCAAAGGTGTGAAATGTGTGGGGATCTGGAGGA 
H751 CCGTGACCGAGGGACATTGGATCGCTATCTCCCATTTCTTTTGCTGTTAC 
CAGTTCAG^TTTTCTTTTCACCTAGTCTTTAATTCCCAGGGTTTTGTTTT 
TTCCTTGGTCATAGTTTTTGTTTTTCACTCTGGCAAATGATGTTGTGAAT 
14 901 TACACTGCTTCAGCCACAAAACTGATGGACTGAATGAGGTCATCAAACAA 
ACTTTTCTTCTTCCGTATTTCCTTTTTTTTCCCCCACTTATCATTTTTAC 
TGCTGTTGTTGAGTCTGTAAGGCTAAAAGTAACTGTTTTGTGCTTTTTCA 
15051 GGACGTGTGCTTTCCAAATTACTGCCACATATATAAAGAAAGGTTGGAAT 
TTTAAAGATAATTCATGTTTCTTCTTC1TTTTTGCCACCACAGTTGCAGA 
TCTTGAAGTAAAAACCAGGGAAAAGCTGGAAGCTGCCAAAAAGAAAACCA 
15201 GTTTTGAAATTGCTGAGCTTAAAGAAAGGTTAAAAGCAAGTCGTGAAACC 
ATCAACTGCTTAAAGAGTGAAATCAGAAAACTCGAAGAGGATGATCAGTC 
TAAAGATATGTGATGAGTGTTGACTTGGCAGGGAGCCTATAATGAGAATG 
i 5351 AAAGGACTTCAGTCGTGGAGTTGTATGCGTTCTCTCCAATTCTGTAACGG 
AG^CTGTATGAATTTCATTTGCAAATCACTGCAGTGTGTGACAACTGACT 
TTTTATAAATGGCAGAAAACAAGAATGAATGTATCCTCATTTTATAGTTA 
15501 AAATCTATGGGTATGTACTGGTTTATTTCAAGGAGAATGGATCGTAGAGA 
CTTGGAGGCCAGATTGCTGCTTGTATTGAClGCATTTGAGTGGTGTAGGA 
ACATTTTGTCTATGGTCCCGTGTTAGTTTACAGAATGCCACTGTTCACTG 
i TTTTGTTTTGTATTTTACTTTTTCTACTGCAACGTCAAGGTTTTAAAAGT 
TGAAAATAAAACATGCAGGTTTTTTTTAAATATTTTTTTGTCTCTATCCA 
GTTTGGGCTT CAAGT ATTATTGTT AACAGCAAGTCCTGATTT AAGTCAGA 
15801 GGCTGAAGTGTAATGGTATTCAAGATGCTTAAGTCTGTTGTCAGCAAAAC 
AAAAGAGAAAACTTCATAAAATCAGGAAGTTGGCATTTCTAATAACTTCT 
TTATCAACAGATAAGAGTTTCTAGCCCTGCATCTACTTTCACTTATGTAG 
15951 TTGATGCCTTTATATTTTGTGTGTTTGGATGCAGGAAGTGATTCCTACTC 
^q^^^ j^q^<j^TCTATTT AACACTTGTACTCTGCTGTGCTTAGCCTT 
16051 TCCCCATGAAAATTCAGCGGCTGTAAATCCCCCTCTTCTTTTGTAGCCTC 
ATACAGATGGCAGACCCTCAGGCTTATAAAGGCTTGGGCATCTTCTTTAC 
TGCTTTGAGATTCTGTGTTGCAGTAACCTCTGCCAGAGAGGAGAAAAGCC 
15201 CCACAAACCTCATCCCCTTCTTCTATAGCAATCAGTATTACTAATGCTTT 
GAGAACAGAGCACTGGTTTGAAACGTTTGATAATTAGCATTTAACATGGC 
TTGGTAAAGATGCAGAACTGAAACAGCTGTGACAGTATGAACTCAGTATG 
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16351 GAGACTTCATTAAGACAAACAGCTGTTAAAATCAGGCATGTTTCATTC-AG 
GAGG^CGGGGCAACTTGCACCAGTGGTGCCCACACAAATCCTTCCTGC-CG 
CTGCAGACCAATTTTTCTGGCATTCTGACTGCCGTTGCTGCTGGTCACAG 

16501 AGAGCAACT ATTTTTATCAGCCACAGGCAATTTGCTTGT AGT ATTT7CCA 
AGTGTTGTAGGTAAGTATAAATGCATCGGCTCCAGAGCACTTTGAGTATA 
CTTATTAAAAACATAAATGAAAGACAAATTAGCTTTGCTTGGGTGCACAG 

16651 AACATTTTT AGTTCCAGCCTGCTTTTTGGTAGAAGCCCTCTTCTGAGGCT 
AGAACTGACTTTGACAAGTAGAGAAACTGGCAACGGAGCTATTGCTATCG 

16751 AAGGATCCTTGTT AACAAAGTT AATCGTCTTTT AAGGTTTGGTTTATTCA 
* r^Tjp.rrci ana AfSAACGTGCTGTCTTCCATG 



TTGAAACTCGATGATCACTGTGGTCCTTTTCAACCCAGGCTATTCTATGA 
TTCTATGATTCAACAGCAAATCATATGTACTGAGAGAGGAAACAAACACA 
AGTGCTACTGTTTGCAAGTTTTGTTCATTTGGTAAAAGAGTCAGGTTTTA 
17301 AAATTCAAAATCTGTCTGGTTTTGGTGTTTTTTTTTTTTTATTTATTATT 
* rTi^(-vT"T"raTPTTTPTr'TCCCAGGACTGTGTG. : i 



AAAGCCTATTGAAAGCCGAGATTAAGCCACCAAAAGAACTCATTTGC^T 
17901 GGAATATGTAGTATTTGCCCTCTTCCTCCCGGGTAATTACTATACTTTAT 

AGGGTGCTTATATGTTAAATGAGTGGCTGGCACTTTTTATTCTCAC-.GCT 

GTGGGGAATTCTGTCCTCTAGGACAGAAACAATTTTAATCTG^CCACTC- 
18051 GTGACTGCTTTGTCAGCACTTCCACCTGAAGAGATCAATACACTC^CAA 

uivsn a a r A rTTGGCAAACCTCACATCTTATTTCATACTCXC 
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GGCGGTTTGCGTATTGGGCGCTCTTCCGCTTCCTCG CT CACTGACTCGCT 

gcgctcggtcgttcggctgcggcgagcggtatcag:tc.-.ctcaaaggcgg 

18801 TA*TACGGTTATCCACAGAATCAGC^^^ 

GCAA^GGCCAGCAA^GGCCAGGAACCGTAAAA^----^-CGTTGCTGGC 

GTTTTTCCATAGGCTCCGCCCCCCTGACGAGCATC-.CA-AA^TCGACGCT 

18951 CAAGTCAGAGGTGGCGAAACCCGACAGGACTATAA.-.3.-.1ACCAGGCGTTT 

CC-GATACCTGTCCGCCTTTCTCCCTTCGGGAAGCGTC-GCGCTTTCTCATA 

19101 gct'cslCgctgtaggtatctcagttcggtgtaggtcgttcgctccaagctg 

GGCVGTGTGCACGAACCCCCCGTTCAGCCCGACCC<-GC^ 
T A 1 CT AT CGTCTTGAGT CC AACCCGGT AAG ACACGA C i ? AT CGCCACTGG 
19251 C AGC AGC CACTGGT AAC AGG ATT AGCAGAGCGAGGT .-. i GTAGGCGGTGCT 
ACAg\gTTCTTGAAGTGGTGGCCTAACTACGGCTACAC7AGAAGGACAGT 

ATTTGGTATCTGCGCTCTGCTGAAGCCAGTTACCTTC3GAAAAA 
19^01 G T 2.GCTCTTGATCCGGC AAACAAACCACCGCTGG i .-.^--GTGGTTTTTTT 

■ 01 gStSSgcagcagattacgcgcagaaaa^ 

T TT GATCTTTTCTACGGGGTCTGACGCTCAGTGG.-_-.-.r. J AAA.CTCACGTT 

19 551 AAGGG ATTTTGGT C ATG AGATT AT C AAAAAGGATCT , Z.-.CCTAGATCCTT 
^^^^^^y^^Q^QTTTTAAATCAATCTAAAGr.-.r.-.TATGAGTAAAC 
TTGGTCTGACAGTTACCAATGCTTAATCAGTGAGGCACCTATCTCAGCGA 

19701 TCTGTCTATTTCGTTCATCCATAGTTGCCTGACTCCCCGTCGTGTAGATA 

19701 Ic^cSScGGGAGGGCTTACCA^ 

GCGAGACCCACGCTCACCGGCTCCAGATTTATCAG^^-"---^^^^^^^® 

19851 CCGGAAGGGCCGAGCGCAGAAGTGGTCCTGCAAC^ - "iE^SSraCTTftA 
CAGTCTATTAATTGTTGCCGGGAAGCTAGAGT^G.. : ..^G^T^ 

TAGTTTGCGCAACGTTGTTGCCATTGCTAC^GGC^.-jiG^TGTCAOTCT 



rCCTTCGGTCC 



TCCGATCGTTGTCAGAAGTAAGTTGGCCGCAGTGTTA7CACTCATGGTTA 
20151 TC^GCACTGCATAATTCTCTTACTGTCATGCCA -C C C-T AAGATGCTTT 

GCGACCGAGTTGCTCTTGCCCGGCGTCAA^CGGG.^-^-j^AC^^^^^ 
20301 ATAGCAGAACTTTAAAAGTGCTCATCATTGGAAA^ - ^"OGGGGCGA 
WACTCTCAAGGATCTTACCGCTGTTGAGATCCAG ^"^^g^ 
TrrTOrACCCAACTGATCTTCAGCATCTTTTACTT i _-.~CAGCGTTTCTG 
20451 GGTGAGCAAAAACAGGAAGGCAAAATGCCGCAA^ 

ACACGGAAATGTTGAATACTCATACTGTTCCTTT, I^™^ 
CATTT ATCAGGGTTATTGTCTCATGAGCGGATACA i A . iTGAATGTATTT 

206 oi Saaaaat^aaat^ 

CCTA^TTGTAAGCGTTAATATTTTGTTAAAATTCu--w-iTAAATTTTTGT 

tS^tcagctcattttttaaccaataggccgaaatcggcaaaatccctta 
20751 taaatcaaaagaatagaccgagatagggttgagtgtt 

ACA\GAGTCCACTATTAAAGAACGTGGACTCCAACGi^-AAGGGCGAAAA 

acc^t^gggcgat^ 
o qo i tSttggggtcgaggtgccgtaaagcactaaatcg:-^.ccctaaaggga 

2090 gcSSg^t^ 
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21051 GGTCACGCTGCGCGTAACCACCACACCCGCCGCGCTTAATGCGCCGCTAC 
AGGGCGCGTCCCATTCGCCATTCAGGCTGCGCAACTGTTGGGAAGGGCGA 
TCGGTGCGGGCCTCTTCGCTATTACGCCAGCTGGCGAAAGGGGGATGTGC 

21201 TGCAAGGCGATTAAGTTGGGTAACGCCa GGGTTTTCCCAGTCACGACGTT 
GTAAAACGACGGCCAGTGAATTGTAATACGACTCACTATAGGGCGAATTG 

213 01 GAGCTCCACCGCGGTGGCGGCCGCTCTAG 
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SEQIDNO:11 

rTACCGCMCCCCCCCTCGAGGTGAATATCCAAGAATGCAGAACTGCATGGAAAGCAGAGCTG 

cIggcacgSSS 

rAarrACCTGTCCCTTACTCCCCTCAGCATTCTGTGCTATTTAGGGTTCTACCAGAGTCCTT 
CTGACACTTGTCTCAAGCTATTAACCAAGTGTCCAGCCAAAATCGATGTCACAACTTGGGAA 

SSSSwgaa^^ 
gaIggSSgg^gccaLgagtg^ 

GCAACTTGGGGGCTATTGAGAAACAAGGAAGGACAAACAGCGTTAGGTCAT^ 

acacagccagggctgctcctctataaaaggggaagaaagaggctccgcagccatcacagacc 

CAGAGGGGACGGTCTGTGAATCAAGCTT 
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SEQ ID NO. 14 

ATGGCTTTGA CCTTTGCCTT ACTGGTGGCT CTCCTGGTGC TGAGCTGCAA GAGCAGCTGC 
TCTGTGGGCT GCGATCTGCC TCA 

SEQ ID NO. 15 

GACCCACAGC CTGGGCAGCA GGAGGACCCT GATGCTGCTG GCTCAGATGA GGAGAATCAG 
CCTGTTTAGC TGCCTGAAGG ATAGGCACGA TTTTGGCTTT 

SEQ ID NO. 16 

CTCAAGAGGA GTTTGGCAAC CAGTTTCAGA AGGCTGAGAC CATCCCTGTG CTGCACGAGA 
TG 

SEQ ID NO. 17 

TCCAGCAGAT CTTTAACCTG TTTAGCACCA AGGATAGCAG CGCTGCTTGG GATGAGACCC 
TGCTGGATAA GTTTTACACC GAGCTGTACC AGCA 

SEQ ID NO. 18 

CTGAACGATC TGGAGGCTTG CGTGATCCAG GGCGTGGGCG TGACCGAGAC CCCTCTGATG 
AAGGAGGATA GCATCCT 

♦ • 

SEQ ID NO. 19 

GCTCTGAGGA AGTACTTTCA GAGGATCACC CTGTACCTGA AGGAGAAGAA GTACAGCCCT 
TGCGCTTGGG AAGTCGTGAG GG 

SEQ ID NO. 20 

CTGAGATCAT GAGGAGCTTT AGCCTGAGCA CCAACCTGCA AGAGAGCTTG AGGTCTAAGG 
AGTAA 

. SEQ ID NO. 21 
IFN-1 

CCCAAGCTTT CACCATGGCT TTGACCTTTG CCTT 

SEQ ID NO. 22 
IFN-2b 

ATCTGCCTCA GACCCACAG. 
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SEQ ID NO. 23 
IFN-3c 

GATTTTGGCT TTCCTCAAGA GGAGTT 

SEQ ID NO. 24 
IFN-4b 

GCACGAGATG ATCCAGCAGA T 

SEQ ID NO. 25 
IFN-5 

ATCGTTCAGC TGCTGGTACA 

SEQ ID NO. 26 
I FN- 6 

CCTCACAGCC AGGATGCTAT 

SEQ ID NO. 27 
IFN-7 

ATGATCTCAG CCCTCACGAC 

SEQ ID- NO. 28 
IFN-2 

CTGTGGGTCT GAGGCAGAT 

SEQ ID NO. 29 
IFN-3b 

AACTCCTCTT GAGGAAAGCC AAAATC 

SEQ ID NO. 30 
IFN-4 

ATCTGCTGGA TCATCTCGTG_ C 

SEQ ID NO. 31 
IFN-8 

TGCTCTAGAC TTTTTACTCC TTAGACCTCA AGCTCT 
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SEQ ID NO. 32 

Oligo 1. TCACTCGAGG TGAATATCCA AGAAT 
SEQ ID NO. 33 

Oligo 2. GAGATCGATT TTGGCTGGAC ACTTG 
SEQ ID NO. 34 

Oligo 3. CACATCGATG TCACAACTTG GGAAT 
SEQ ID NO. 35 

Oligo 4. TCTAAGCTTC GTCACAGACC GTCCC 
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SEQUENCE LISTING 

<110> AviGenics, Inc. 

<120> Production of Transgenic Avians 
Using Sperm- media ted Transfection 

<130> 11106-021-228 

<140> To be assigned 
<141> 2002-09-18 

<150> 60/324,001 
<151> 2001-09-21 

<150> 60/323,961 
<151> 2001-09-21 

<160> 35 

<170> Patentln version 3.1 

<210> 1 

<211> 20 

<212> DMA 

<213> Artificial sequence 
<220> 

<223> Primer 5pLMAR2 

<400> 1 

tgccgccttc tttgatattc 



<210> 2 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer LE-6.1kbrevl 

<400> 2 

ttggtggtaa ggcctttttg 



<210> 3 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer lys-6.1 

<400> 3 

ctggcaagct gtcaaaaaca 



WO 03/024199 PCT/US02/30156 



<2I0> 4 

<211> 20 

<212> DNA 

<213> Artificial sequence - 
<220> 

<223> Primer LysElrev 

<400> 4 

cagctcacat cgtccaaaga 20 

<210> 5 

<211> 498 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> I FNMAGMAX 
<220> 

<221> misc_feature 

<222> (1)..(498) 
<223> 

<400> 5 

tgcgatctgc ctcagaccca cagcctgggc agcaggagga ccctgatgct gctggctcag 60 

atgaggagaa tcagcctgtt tagctgcctg aaggataggc acgattttgg ctttcctcaa 120 

gaggagtttg gcaaccagtt tcagaaggct gagaccatcc ctgtgctgca cgagatgatc 180 

cagcagatct ttaacctgtt tagcaccaag gatagcagcg ctgcttggga tgagaccctg 240 

ctggataagt tttacaccga gctgtaccag cagctgaacg atctggaggc ttgcgtgatc 300 

cagggcgtgg gcgtgaccga gacccctctg atgaaggagg atagcatcct ggctgtgagg 360 

aagtactttc agaggatcac cctgtacctg aaggagaaga agtacagccc ctgcgcttgg 420 

gaagtcgtga gggctgagat catgaggagc tttagcctga gcaccaacct gcaagagagc 480 

ttgaggtcta aggagtaa 498 

<210> 6. 

<211> 12728 

<212> DNA 

<213> Gallus gallus 

<220> 

<221> misc_feature 

<222> (1)..(237) 

<223> 5prime matrix (scaffold) attachment region (MAR) 
<220> 

<221> misc feature 



-2- 
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<222> (261) .. (1564) 

<223> 5prime matrix (scaffold) attachment region (MAR) 
<220> 

<221> misc_f eature 

<222> (1565) . . (1912) 

<223> Sprime matrix (scaffold) attachment region (MAR) 



<220> 

<221> mi s cofeature 

<222> (1930) . . (2012) 

<223> Sprime matrix (scaffold) attachment region (MAR) 



<220> 

<221> misc__f eature 

<222> (2013) . . (2671) 

<223> Intrinsically curved DNA 



<220> 

<221> misc_feature 

<222> (5848) . . (5934) 

<223> Transcription Enhancer 



<220> 

<221> misc_feature 

<222> (9160) . . (9325) 

<223> Transcription Enhancer 



<220> 

<221> mis cofeature 

<222> (9326) (9626) 

<223> Negative Regulatory Element 



<220> 

<221> mis cofeature 

<222> (9621) . . (9660) 

<223> Hormone Response Element 



<220> 

<221> mi s cofeature 

<222> (9680) (10060) 

<223> Hormone Response Element 



<220> 

<221> misc__feature 

<222> (10576) . . (10821) 

<223> Chicken CR1 Repeat Sequence 



WO 03/024199 PCTAJS02/30156 



<220> 

<221> misc_feature 

<222> (10926) . . (11193) 

<223> Chicken CR1 Repeat Sequence 



<220> 

<221> misc_feature 

<222> (11424) . . (11938) 

<223> Lysozyme Proximal Promoter and Lysozyme Signal Peptide 



<220> 

<221> misc__feature 
<222> (11946) . . (12443) 

<223> Human Interferon alpha 2d encoding region codon optimized for exp 
ression in chicken cells (IFNMAGMAX) 



<220> 

<221> polyA_signal 
<222> (12444) . . (12728) 
<223> 

<400> 6 

tgccgccttc tttgatattc actctgttgt atttcatctc ttcttgccga tgaaaggata 60 

taacagtctg tataacagtc tgtgaggaaa tacttggtat ttcttctgat cagtgttttt 120 

ataagtaatg ttgaatattg gataaggctg tgtgtccttt gtcttgggag acaaagccca 180 

cagcaggtgg tggttggggt ggtggcagct cagtgacagg agaggttttt ttgcctgttt 24 0 

tttttttttt tttttttttt aagtaaggtg ttcttttttc ttagtaaatt ttctactgga 300 

ctgtatgttt tgacaggtca gaaacatttc ttcaaaagaa gaaccttttg gaaactgtac 360 

agcccttttc tttcattccc tttttgcttt ctgtgccaat gcctttggtt ctgattgcat 420 

tatggaaaac gttgatcgga acttgaggtt tttatttata gtgtggcttg aaagcttgga 4 80 

• tagctgttgt tacacgagat accttattaa gtttaggcca gcttgatgct ttattttttc 540 

cctttgaagt agtgagcgtt ctctggtttt tttcctttga aactggtgag gcttagattt 600 

ttctaatggg attttttacc tgatgatcta gttgcatacc caaatgcttg . taaatgtttt 660 

cctagttaac atgttgataa cttcggattt acatgttgta tatacttgtc atctgtgttt 720 

ctagtaaaaa tatatggcat ttatagaaat acgtaattcc tgatttcctt tttttttatc 780 

tctatgctct gtgtgtacag gtcaaacaga cttcactcct atttttattt atagaatttt 840 

atatgcagtc tgtcgttggt tcttgtgttg taaggataca gccttaaatt tcctagagcg 900 

atgctcagta aggcgggttg tcacatgggt tcaaatgtaa aacgggcacg tttggctgct 960 

gccttcccga gatccaggac actaaactgc ttctgcactg aggtataaat cgcttcagat 1020 
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cccagggaag tgcagatcca cgtgcatatt cttaaagaag aatgaatact ttctaaaata 10SG 

ttttggcata ggaagcaagc tgcatggatt tgtttgggac ttaaattatt ttggtaacgg 1140 

agtgcatagg ttttaaacac agttgcagca tgctaacgag tcacagcgtt tatgcagaag 1200 

tgatgcctgg atgcctgttg cagctgttta cggcactgcc ttgcagtgag cattgcagat 1260 

aggggtgggg tgctttgtgt cgtgttccca cacgctgcca cacagccacc tcccggaaca 1320 

catctcacct gctgggtact tttcaaacca tcttagcagt agtagatgag ttactatgaa 1380 

acagagaagt tcctcagttg gatattctca tgggatgtct tttttcccat gttgggcaaa 1440 

gtatgataaa gcatctctat ttgtaaatta tgcacttgtt agttcctgaa tcctttctat 1500 

agcaccactt attgcagcag gtgtaggctc tggtgtggcc tgtgtctgtg cttcaatctt 1560 

ttaaagcttc tttggaaata cactgacttg attgaagtct cttgaagata gtaaacagta 1620 

cttacctttg atcccaatga aatcgagcat ttcagttgta aaagaattcc gcctattcat 1680 

accatgtaat gtaattttac acccccagtg ctgacacttt ggaatatatt caagtaatag 1740 

actttggcct caccctcttg tgtactgtat tttgtaatag aaaatatttt aaactgtgca 1800 

tatgattatt acattatgaa agagacattc tgctgatctt caaatgtaag aaaatgagga 18 60 

gtgcgtgtgc ttttataaat acaagtgatt gcaaattagt gcaggtgtcc ttaaaaaaaa 1920 

aaaaaaaaag taatataaaa aggaccaggt gttttacaag tgaaatacat tcctatttgg 1980 

taaacagtta catttttatg aagattacca gcgctgctga ctttctaaac ataaggctgt 2040 

attgtcttcc tgtaccattg catttcctca ttcccaattt gcacaaggat gtctgggtaa 2100 

actattcaag aaatggcttt gaaatacagc atgggagctt gtctgagttg gaatgcagag 2160 

ttgcactgca aaatgtcagg aaatggatgt ctctcagaat gcccaactcc aaaggatttt 2220 

atatgtgtat atagtaagca gtttcctgat tccagcaggc caaagagtct gctgaatgtt 2280 

gtgttgccgg agacctgtat ttctcaacaa ggtaagatgg tatcctagca actgcggatt 2340 

ttaatacatt ttcagcagaa gtacttagtt aatctctacc tttagggatc gtttcatcat 2400 

ttttagatgt tatacttgaa atactgcata acttttagct ^ttcatgggtt cctttttttc 2460 

agcctttagg agactgttaa gcaatttgct gtccaacttt tgtgttggtc ttaaactgca 2520 

atagtagttt accttgtatt gaagaaataa agaccatttt tatattaaaa aatacttttg 2580 

tctgtcttca ttttgacttg tctgatatcc ttgcagtgcc cattatgtca gttctgtcag 2640 

atattcagac atcaaaactt aacgtgagct cagtggagtt acagctgcgg ttttgatgct 2700 

gttattattt ctgaaactag aaatgatgtt gtcttcatct gctcatcaaa cacttcatgc 2760 
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agagtgtaag 


gctagtgaga 


aatgcataca 


tttattgata 


cttttttaaa 


gtcaactttt 


^ a o n 


tatcagattt 


ttttttcatt 


tggaaatata 


ttgttttcta 


gactgcatag 


cttctgaatc 


2880 


tgaaatgcag 


tctgattggc 


atgaagaagc 


acagcactct 


tcatcttact 


taaacttcat 


2940 


tttggaatga 


aggaagttaa 


gcaagggcac 


aggtccatga 


aatagagaca 


gtgcgctcag 


3000 


gagaaagtga 


acctggattt 


ctttggctag 


tgttctaaat 


ctgtagtgag gaaagtaaca 


3060 


cccgattcct 


tgaaagggct 


ccagctttaa 


tgcttccaaa 


ttgaaggtgg caggcaactt 


3120 


ggccactggt 


tatttactgc 


attatgtctc 


agtttcgcag 


ctaacctggc 


ttctccacta 


3180 


ttgagcatgg 


actatagcct 


ggcttcagag 


gccaggtgaa 


ggttgggatg 


ggtggaagga 


3240 


gtgctgggct 


gtggctgggg 


ggactgtggg 


gactccaagc 


tgagcttggg 


gtgggcagca 


3300 


cagggaaaag 


tgtgggtaac 


tatttttaag 


tactgtgttg 


caaacgtctc 


atctgcaaat 


3360 


acgtagggtg 


tgtactctcg 


aagattaaca 


gtgtgggttc 


agtaatatat 


ggatgaattc 


3420 


acagtggaag 


cattcaaggg 


tagatcatct 


aacgacacca 


gatcatcaag 


ctatgattgg 


3480 


aagcggtatc 


agaagagcga 


ggaaggtaag 


cagtcttcat 


atgttttccc 


tccacgtaaa 


3540 


gcagtctggg 


aaagtagcac 


cccttgagca 


gagacaagga 


aataattcag 


gagcatgtgc 


3600 


taggagaact 


ttcttgctga 


attctacttg 


caagagcttt 


gatgcctggc 


ttctggtgcc 


3660 


ttctgcagca 


cctgcaaggc 


ccagagcctg 


tggtgagctg 


gagggaaaga 


ttctgctcaa 


3720 


gtccaagctt 


cagcaggtca 


ttgtctttgc 


ttcttccccc 


agcactgtgc 


agcagagtgg 


3780 


aactgatgtc 


gaagcctcct 


gtccactacc 


tgttgctgca 


ggcagactgc tctcagaaaa 


3840 


agagagctaa 


ctctatgcca 


tagtctgaag 


gtaaaatggg 


ttttaaaaaa 


gaaaacacaa 

* 


3900 


aggcaaaacc 


ggctgcccca 


tgagaagaaa 


gcagtggtaa 


acatggtaga 


aaaggtgcag 


3960 


aagcccccag 


gcagtgtgac 


aggcccctcc 


tgccacctag 


aggcgggaac 


aagcttccct 


4020 


gcctagggct 


ctgcccgcga 


agtgcgtgtt 


tctttggtgg 


gttttgtttg 


gcgtttggtt 


4080 


ttgagattta 


gacacaaggg 


aagcctgaaa 


ggaggtgttg 


ggcactattt 


tggtttgtaa 


4140 


agcctgtact 


tcaaatatat 


attttgtgag 


ggagtgtagc 


gaattggcca 


atttaaaata 


4200 


aaqttqcaaa 


aaattaaaqq 


ctaaqtagtt 


qagaqggtaa 


cacgtttaat 


gagatcttct 


4260 


gaaactactg 


cttctaaaca 


cttgtttgag 


tggtgagacc 


ttggataggt gagtgctctt 


4320 


gttacatgtc 


tgatgcactt 


gcttgtcctt 


ttccatccac 


atccatgcat 


tccacatcca 


4380 


cgcatttgtc 


acttatccca 


tatctgtcat 


atctgacata 


cctgtctctt 


cgtcacttgg 


4440 


tcagaagaaa 


cagatgtgat 


aatccccagc 


cgccccaagt 


ttgagaagat 


ggcagttgct 


4500 
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4560 


tcttcctgcc 


ttacattctg 


ggcattattt 


caaatatctt 


tggagtgcgc tgctctcaag 


4620 


tttgtgtctt 


cctactctta 


gagtgaatgc 


tcttagagtg 


aaagagaagg aagagaagat 


4680 


gttggccgca 


gttctctgat 


gaacacacct 


ctgaataatg 


gccaaaggtg ggtgggtttc 


4740 


tctgaggaac 


gggcagcgtt 


tgcctctgaa 


agcaaggagc 


tctgcggagt tgcagttatt 


4800 


ttgcaactga 


tggtggaact 


ggtgcttaaa 


gcagattccc 


taggttccct gctacttctt 


4860 


ttccttcttg 


gcagtcagtt 


tatttctgac 


agacaaacag 


ccacccccac tgcaggctta 


4920 


gaaagtatgt 


ggctctgcct 


gggtgtgtta 


cagctctgcc 


ctggtgaaag gggattaaaa 


4980 


cgggcaccat 


tcatcccaaa 


caggatcctc 


attcatggat 


caagctgtaa ggaacttggg 


5040 


ctccaacctc 


aaaacattaa 


ttggagtacg 


aatgtaatta 


aaactgcatt ctcgcattcc 


5100 


taagtcattt 


agtctggact 


ctgcagcatg 


taggtcggca 


gctcccactt tctcaaagac 


5160 


cactgatgga 


ggagtagtaa 


aaatggagac 


cgattcagaa 


caaccaacgg agtgttgccg 


5220 


aagaaactga 


tggaaataat 


gcatgaattg 


tgtggtggac 


atttttttta aatacataaa 


5280 


ctacttcaaa 


tgaggtcgga 


gaaggtcagt 


gttttattag 


cagccataaa accaggtgag 


5340 


cgagtaccat 


ttttctctac 


aagaaaaacg 


attctgagct 


ctgcgtaagt ataagttctc 


5400 


catagcggct 


gaagctcccc 


cctggctgcc 


tgccatctca 


gctggagtgc agtgccattt 


5460 


ccttggggtt 


tctctcacag 


cagtaatggg 


acaatacttc 


acaaaaattc tttcttttcc 


5520 


tgtcatgtgg 


gatccctact 


gtgccctcct 


ggttttacgt 


taccccctga ctgttccatt 


5580 


cagcggtttg 


gaaagagaaa 


aagaatttgg 


aaataaaaca 


tgtctacgtt atcacctcct 


5640 


ccagcatttt 


ggtttttaat 


tatgtcaata 


actggcttag 


atttggaaat gagagggggt 


5700 


tgggtgtatt 


accgaggaac 


aaaggaaggc 


ttatataaac 


tcaagtcttt tatttagaga 


5760 


actggcaagc 


tgtcaaaaac aaaaaggcct 


taccaccaaa 


ttaagtgaat agccgctata 


5820 


gccagcaggg 


ccagcacgag 


ggatggtgca 


ctgctggcac 


tatgccacgg cctgcttgtg 


5880 


actctgagag 


caactgcttt 


ggaaatgaca 


gcacttggtg 


caatttcctt tgtttcagaa 


5940 


tgcgtagagc 


gtgtgcttgg 


cgacagtttt 


tctagttagg 


ccacttcttt tttccttctc 


6000 


tcctcattct 


cctaagcatg tctccatgct 


ggtaatccca 


gtcaagtgaa cgttcaaaca 


6060 


atgaatccat 


cactgtagga 


ttctcgtggt 


gatcaaatct 


ttgtgtgagg tctataaaat 


6120 


atggaagctt 


atttattttt 


cgttcttcca 


tatcagtctt 


ctctatgaca attcacatcc 


6180 


accacagcaa 


attaaaggtg 


aaggaggctg 


gtgggatgaa 


gagggtcttc tagctttacg 


6240 
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ttcttccttg caaggccaca ggaaaatgct gayayetgta gaatacagcc tggggtaaga 
agttcagtct cctgctggga cagctaaccg catcttataa ccccttctga gactcatctt 
aggaccaaat agggtctatc tggggttttt gttcctgctg ttcctcctgg aaggctatct 
cactatttca ctgctcccac ggttacaaac caaagataca gcctgaattt tttctaggcc 
acattacata aatttgacct ggtaccaata ttgttctcta tatagttatt tccttcccca 
ctgtgtttaa ccccttaagg cattcagaac aactagaatc atagaatggt ttggattgga 
aggggcctta aacatcatcc atttccaacc ctctgccatg ggctgcttgc cacccactgg 
ctcaggctgc ccagggcccc atccagcctg gccttgagca cctccaggga tggggcaccc 
acagcttctc tgggcagcct gtgccaacac ctcaccactc tctgggtaaa gaattctctt 
ttaacatcta atctaaatct cttctctttt agtttaaagc cattcctctt tttcccgttg 
ctatctgtcc aagaaatgtg tattggtctc cctcctgctt ataagcagga agtactggaa 
ggctgcagtg aggtctcccc acagccttct cttctccagg ctgaacaagc ccagctcctt 
cagcctgtct tcgtaggaga tcatcttagt ggccctcctc tggacccatt ccaacagttc 
cacggctttc ttgtggagcc ccaggtctgg atgcagtact tcagatgggg ccttacaaag 
gcagagcaga tggggacaat cgcttacccc tccctgctgg ctgcccctgt tttgatgcag 
cccagggtac tgttggcctt tcaggctccc agaccccttg ctgatttgtg tcaagctttt 
catccaccag aacccacgct tcctggttaa tacttctgcc ctcacttctg taagcttgtt 
tcaggagact tccattcttt aggacagact gtgttacacc tacctgccct attcttgcat 
atatacattt cagttcatgt ttcctgtaac aggacagaat atgtattcct ctaacaaaaa 
tacatgcaga attcctagtg ccatctcagt agggttttca tggcagtatt agcacatagt 
caatttgctg caagtacctt ccaagctgcg gcctcccata aatcctgtat ttgggatcag 
ttaccttttg gggtaagctt ttgtatctgc agagaccctg ggggttctga tgtgcttcag 
ctctgctctg ttctgactgc accattttct agatcaccca gttgttcctg tacaacttcc 
ttgtcctcca tcctttccca gcttgtatct ttgacaaata caggcctatt tttgtgtttg 
cttcagcagc catttaattc ttcagtgtca tcttgttctg ttgatgccac tggaacagga 
ttttcagcag tcttgcaaag aacatctagc tgaaaacttt ctgccattca atattcttac 
cagttcttct tgtttgaggt gagccataaa ttactagaac ttcgtcactg acaagtttat 
gcattttatt acttctatta tgtacttact ttgacataac acagacacgc acatattttg 
ctgggatttc cacagtgtct ctgtgtcctt cacatggttt tactgtcata cttccgttat 



6300 

6360 

6420 

6480 

6540 

6600 

6660 

6720 

6780 

6840 

6900 

6960 

7020 

7080 

7140 

7200 

7260 

7320 

7380 

7440 

7500 

7560 

7620 

7680 

7740 

7800 

7860 

7920 

7980 
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aaccttggca 


atctgcccag 


CilyCCCd 




VjU U k.UV> k. k V > 




9040 


tcttcagcca 


ataaacaaaa 

« ■ 


tgtgagaagc 


ccaaacaaga 


acttgtgggg 


caggctgcca 


8100 


tcaagggaga 


gacagctgaa. 


gggttgtgta 


• 

gctcaataga 


attaagaaat 


aataaagctg 


8160 


tgtcagacag 


ttttgcctga 


tttatacagg 


cacgccccaa 


gccagagagg 


ctgtctgcca 


8220 


aggccacctt 


gcagtccttg 


gtttgtaaga 


taagtcatag 


gtaacttttc 


tggtgaattg 


8280 


cgtggagaat 


catgatggca 


gttcttgctg 


tttactatgg 


taagatgcta 


aaataggaga 


8340 


cagcaaagta 


acacttgctg 


ctgtaggtgc 


tctgctatcc 


agacagcgat 


ggcactcgca 


8400 


caccaagatg 


agggatgctc 


ccagctgacg 


gatgctgggg 


cagtaacagt 


gggtcccatg 


8460 


ctgcctgctc 


attagcatca 


cctcagccct 


caccagccca 


tcagaaggat 


catcccaagc 


8520 


tgaggaaagt 


tgctcatctt 


cttcacatca 


tcaaaccttt 


ggcctgactg 


atgcctcccg 


8580 


gatgcttaaa" 


tgtggtcact 


gacatcttta 


tttttctatg 


atttcaagtc 


agaacctccg 


8640 


gatcaggagg 


gaacacatag 


tgggaatgta 


ccctcagctc 


caaggccaga 


tcttccttca 


8700 


atgatcatgc 


atgctactta 


ggaaggtgtg 


tgtgtgtgaa 


tgtagaattg 


cctttgttat 


8760 


tttttcttcc 


tgctgtcagg 


aacattttga 


ataccagaga 


aaaagaaaag 


tgctcttctt 


8820 


ggcatgggag 


gagttgtcac 


acttgcaaaa 


taaaggatgc 


agtcccaaat 


gttcataatc 


8880 


tcagggtctg 


aaggaggatc 


agaaactgtg 


tatacaattt 


caggcttctc 


tgaatgcagc 


8940 


ttttgaaagc 


tgttcctggc 


cgaggcagta 


ctagtcagaa 


ccctcggaaa 


caggaacaaa 


9000 


tgtcttcaag 


gtgcagcagg aggaaacacc 


ttgcccatca 


tgaaagtgaa 


taaccactgc 


9060 


cgctgaagga atccagctcc tgtttgagca 


ggtgctgcac 


actcccacac 


tgaaacaaca 


9120 


gttcattttt 


ataggacttc caggaaggat 


cttcttctta 


agcttcttaa 


ttatggtaca 


9180 


tctccagttg gcagatgact 


atgactactg 


acaggagaat 


gaggaactag 


ctgggaatat 


9240 


ttctgtttga ccaccatgga gtcacccatt 


tctttactgg 


tatttggaaa 


taataattct 


9300 


gaattgcaaa 


gcaggagtta 


gcgaagatct 


tcatttcttc 


catgttggtg 


acagcacagt 


9360 


tctggctatg aaagtctgct 


tacaaggaag 


aggataaaaa 


tcatagggat 


aataaatcta 


9420 


agtttgaaga 


caatgaggtt 


ttagctgcat 


ttgacatgaa 


gaaattgaga 


cctctactgg 


9480 


atagctatgg tatttacgtg tctttttgct 


tagttactta 


ttgaccccag 


ctgaggtcaa 


9540 


gtatgaactc aggtctctcg ggctactggc 


atggattgat 


tacatacaac 


tgtaatttta 


9600 


gcagtgattt 


agggtttatg agtacttttg 


cagtaaatca 


tagggttagt 


aatgttaatc 


9660 


tcagggaaaa 


aaaaaaaaag 


ccaaccctga 


cagacatccc 


agctcaggtg 


gaaatcaagg 


9720 
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agtgugy t^c 


uayaya5C5C 


agggactctt ctcttaggac ctttatgtac 


9780 


agggcctcaa 


gataactgat 


gttagtcaga 


agactttcca ttctggccac agttcagctg 


9840 


aggcaatcct 


ggaattttct 


ctccgctgca 


cagttccagt catcccagtt tgtacagttc 


9900 


tggcactttt 


tgggtcaggc 


cgtgatccaa 


ggagcagaag ttccagctat ggtcagggag 


9960 


tgcctgaccg 


tcccaactca 


ctgcactcaa 


acaaaggcga aaccacaaga gtggcttttg 


10020 


ttgaaattgc 


agtgtggccc 


agaggggctg 


caccagtact ggattgacca cgaggcaaca 


10080 


ttaatcctca 


gcaagtgcaa 


tttgcagcca 


ttaaattgaa ctaactgata ctacaatgca 


10140 


atcagtatca 


acaagtggtt 


tggcttggaa 


gatggagtct aggggctcta caggagtagc 


10200 


tactctctaa 


tggagttgca 


ttttgaagca 


ggacactgtg aaaagctggc ctcctaaaga 


10260 


ggctgctaaa 


cattagggtc 


aattttccag 


tgcactttct gaagtgtctg cagttcccca 


10320 


tgcaaagctg 


cccaaacata 


gcacttccaa 


ttgaatacaa ttatatgcag gcgtactgct 


10380 


tcttgccagc 


actgtccttc 


tcaaatgaac 


tcaacaaaca atttcaaagt ctagtagaaa 


10440 


gtaacaagct 


ttgaatgtca 


ttaaaaagta 


tatctgcttt cagtagttca gcttatttat 


10500 


gcccactaga 


aacatcttgt 


acaagctgaa 


cactggggct ccagattagt ggtaaaacct 


10560 


actttataca 


atcatagaat 


catagaatgg 


cctgggttgg aagggacccc aaggatcatg 


10620 


aagatccaac 


acccccgcca 


caggcagggc 


caccaacctc cagatctggt actagaccag 


10680 


gcagcccagg 


gctccatcca 


acctggccat 


gaacacctcc agggatggag catccacaac 


10740 


ctctctgggc 


agcctgtgcc 


agcacctcac 


caccctctct gtgaagaact tttccctgac 


10800 


atccaatcta 


agccttccct 


ccttgaggtt 


agatccactc ccccttgtgc tatcactgtc 


10860 


tactcttgta 


aaaagttgat 


tctcctcctt 


tttggaaggt tgcaatgagg tetccttgca 


10920 


gccttcttct 


cttctgcagg 


atgaacaagc 


ccagctccct cagcctgtct ttataggaga 


10980 


ggtgctccag 


ccctctgatc 


atctttgtgg 


ccctcctctg gacccgctcc aagagctcca 


11040 


catctttcct 


gtactggggg 


ccccaggcct 


gaatgcagta ctccagatgg ggcctcaaaa 


11100 


gagcagagta 


aagagggaca 


atcaccttcc 


tcaccctgct ggccagccct cttctgatgg 


11160 


agccctggat 


acaactggct 


ttctgagctg 


caacttctcc ttatcagttc cactattaaa 


11220 


acaggaacaa 


tacaacaggt 


gctgatggcc 


agtgcagagt tttteacact tcttcatttc 


11280 


ggtagatctt 


agatgaggaa 


cgttgaagtt 


gtgcttctgc gtgtgcttct tcctcctcaa 


11340 


atactcctgc 


ctgatacctc 


accccacctg 


ccactgaatg gctccatggc cccctgcagc 


11400 


cagggccctg 


atgaacccgg 


cactgcttca 


gatgctgttt aatagcacag tatgaccaag 


11460 
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ttgcacctat 


gaatacacaa 


acaatgtgtt 


gcatecttca 




t— t ^ u* ^ ^ v.* 


1 1 «^9n 

-h- **. U 


aatttgcatt 


gtcaggaaat 


ggtttagtaa 


ttctgccaat 


taaaacttgt 


ttatctacca 


11580 


tggctgtttt 


tatggctgtt 


agtagtggta 


cactgatgat 


gaacaatggc 


tatgcagtaa 


11640 


aatcaagact 


gtagatattg 


caacagacta 


taaaattcct 


ctgtggctta 


gccaatgtgg 


11700 


tacttcccac 


attgtataag 


aaatttggca 


agtttagagc 


aatgtttgaa 


gtgttgggaa 


11760 


atttctgtat 


actcaagagg 


gcgtttttga 


caactgtaga 

• 


acagaggaat 


caaaaggggg 


11820 


tgggaggaag 


ttaaaagaag 


aggcaggtgc 


aagagagctt 


gcagtcccgc 


tgtgtgtacg 


11880 


acactggcaa 


catgaggtct 


ttgctaatct 


tggtgctttg 


cttcctgccc 


ctggctgcct 


11940 


tagggtgcga 


tctgcctcag 


acccacagcc 


tgggcagcag 


gaggaccctg 


atgctgctgg 


12000 


ctcagatgag 


gagaatcagc 


ctgtttagct 


gcctgaagga 


taggcacgat 


tttggctttc 


12060 


ctcaagagga 


gtttggcaac 


cagtttcaga 


aggctgagac 


catccctgtg 


ctgcacgaga 


12120 


tgatccagca 


gatctttaac 


ctgtttagca 


ccaaggatag 


cagcgctgct 


tgggatgaga 


12180 


ccctgctgga 


taagttttac 


accgagctgt 


accagcagct 


gaacgatctg 


gaggcttgcg 


12240 


tgatccaggg 


cgtgggcgtg 


accgagaccc 


ctctgatgaa 


ggaggatagc 


atcctggctg 


12300 


tgaggaagta 


ctttcagagg 


atcaccctgt 


acctgaagga 


gaagaagtac 


agcccctgcg 


12360 


cttgggaagt 


cgtgagggct 


gagatcatga 


ggagctttag 


cctgagcacc 


aacctgcaag 


12420 


agagcttgag 


gtctaaggag 


taaaaagtct 


agagtcgggg 


cggccggccg 


cttcgagcag 


12480 


acatgataag 


atacattgat 


gagtttggac 


aaaccacaac tagaatgcag 


tgaaaaaaat 


12540 


gctttatttg 


tgaaatttgt 


gatgctattg ctttatttgt 


aaccattata 


agctgcaata 


12600 


aacaagttaa 


caacaacaat 


tgcattcatt 


ttatgtttca ggttcagggg 


gaggtgtggg 


12660 


aggtttttta 


aagcaagtaa 


aacctctaca aatgtggtaa aatcgataag 


gatccgtcga 


12720 


gcggccgc 


* 










12728 



<210> 7 

<211> 11945 

<212> DNA 

<213> Gallus gallus 

<220> 

<221> misc_feature 

<222> (1)..(237) 

<223> 5pxime matrix attachment region (MAR) 
<220> 

<221> misc feature 
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<2'l2> (261) . . (1564) 

<223> Sprime matrix attachment region (MAR) 



<220> 

<221> misc_f eature 

<222> (1565) . . (1912) 

<223> Sprime matrix attachment region (MAR) 



<220> 

<221> misc_feature 

<222> (1930) . . (2012) 

<223> 5prime matrix attachment region (MAR) 



<220> 

<221> misc_feature 

<222> (2013) . . (2671) 

<223> Intrinsically Curved DNA 



<220> 

<221> misc__f eature 

<222> (5848) . . (5934) 

<223> Transcription Enhancer 



<220> 

<221> misc_feature 

<222> (9160) . . (9325) 

<223> Transcription Enhancer 



<220> 

<221> misc_feature 

<222> (9326) . . (9626) 

<223> Negative Regulatory Element 



<220> 

<221> misc_feature 

<222> (9621) . . (9660) 

<223> Hormone Response Element 



<220> 

<221> mis cofeature 

<222> (9680) . . (10060) 

<223> Hormone Response Element 



<220> 

<221> misc_f eature 

<222> (10576) (10821) 

<223> Chicken CR1 Repeat 
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<220> 

<221> misc_f eature 

<222> (10926) . . (11193) 

<223> Chicken CRl Repeat 

<220> 

<221> misc_feature 

<222> (11424) . . (11938) 

<223> Proximal promoter and lysozyme signal peptide 



<400> 7 
tgccgccttc 


tttgatattc 


actctgttgt 


atttcatctc 


ttcttgccga 


tgaaaggata 


60 


taacagtctg 


tataacagtc 


tgtgaggaaa 


tacttggtat 


ttcttctgat 


cagtgttttt 


120 


a taaqtaatq 


ttgaatattg 


gataaggctg 


tgtgtccttt 


gtcttgggag 


acaaagccca 


180 


caacaaotaq 


tqqttqqqqt 


qqtgqcagct 


cagtgacagg 


agaggttttt 


ttgcctgttt 


240 


tttttttttt 

W V* ^» ^» 


tttttttttt 


aagtaaggtg 


ttcttttttc 


ttagtaaatt 


ttctactgga 


300 


ctatatot t t 


toacaqqtca 


gaaacatttc 


ttcaaaagaa 


gaaccttttg 


gaaactgtac 


360 


aacccttttc 


tttcattccc 


tttttqcttt 


ctgtgccaat 


gcctttggtt 


ctgattgcat 


420 


tatqaaaaac 


qttqatcqqa 


acttgaggtt 


tttatttata 


gtgtggcttg 


aaagcttgga 


480 


taqctqttqt 


tacacgagat 


accttattaa 


gtttaggcca 


gcttgatgct 


ttattttttc 


540 


cctttqaaqt 


agtgagcgtt 


ctctggtttt 


tttcctttga 


aactggtgag 


gcttagattt 


600 


ttctaatggg 


attttttacc 


tgatgatcta 


gttgcatacc 


caaatgcttg 


taaatgtttt 


660 


cctagttaac 


atgttgataa 


cttcggattt 


acatgttgta 


tatacttgtc 


atctgtgttt 


720 


ctagtaaaaa 


tatatggcat 


*K «Mh Mb «k m* 

ttatagaaa u 


acgtaattcc 


tgatttcctt 




780 


tctatgctct 


gtgtgtacag 


gtcaaacaga 


cttcactcct 


atttttattt 


atagaatttt 


840 


atatgcagtc 


tgtcgttggt 


tcttgtgttg 


taaggataca 


gccttaaatt 


tcctagagcg 


900 


atgctcagta 


aggcgggttg 


tcacatgggt 


tcaaatgtaa 


aacgggcacg 


tttggctgct 


960 


gccttcccga 


gatccaggac 


actaaactgc 


ttctgcactg 


aggtataaat 


cgcttcagat 


1020 


cccagggaag 


tgcagatcca 


cgtgcatatt 


cttaaagaag aatgaatact 


ttctaaaata 


1080 


ttttggcata 


ggaagcaagc 


tgcatggatt 


tgtttgggac 


ttaaattatt 


ttggtaacgg 


1140 


agtgcatagg 


ttttaaacac 


agttgcagca 


tgctaacgag 


tcacagcgtt 


tatgcagaag 


1200 


tgatgcctgg 


atgcctgttg 


cagctgttta 


cggcactgcc 


ttgcagtgag 


cattgcagat 


1260 


aggggtgggg 


tgctttgtgt 


cgtgttccca 


cacgctgcca 


cacagccacc 


tcccggaaca 


1320 


catctcacct 


gctgggtact 


tttcaaacca 


tcttagcagt 


agtagatgag 


ttactatgaa 


1380 
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acagagaagt 


tcctcagttg 


gatat tctca 


tgggatgtct 


1. 


t-i-rrrnr-ta a a 


14 4 0 


gtatgataaa gcatctctat 


ttgtaaatta 


tgcacttgtt 


agttcctgaa 


tcctttctat 


1500 


agcaccactt 


attgcagcag 


gtgtaggctc 


tggtgtggcc 


tgtgtctgtg 


cttcaatctt 


1560 


ttaaagcttc 


tttggaaata 


cactgacttg 


attgaagtct 


cttgaagata 


gtaaacagta 


1620 


cttacctttg atcccaatga 


aatcgagcat 


ttcagttgta 


aaagaattcc 


gcctattcat 


1680 


accatgtaat 


gtaattttac 


acccccagtg 


ctgacacttt 


ggaatatatt 


caagtaatag 


1740 


actttggcct 


caccctcttg 


tgtactgtat 


tttgtaatag 


aaaatatttt 


aaactgtgca 


1800 


tatgattatt 


acattatgaa 


agagacattc 


tgctgatctt 


caaatgtaag 


aaaatgagga 


1860 


gtgcgtgtgc 


ttttataaat 


acaagtgatt 


gcaaattagt 


gcaggtgtcc 


ttaaaaaaaa 


1920 


aaaaaaaaag 


taatataaaa 


aggaccaggt 


gttttacaag 


tgaaatacat 


tcctatttgg 


1980 


taaacagtta 


catttttatg 


aagattacca 


gcgctgctga 


ctttctaaac 


ataaggctgt 


2040 


attgtcttcc 


tgtaccattg 


catttcctca 


ttcccaattt 


gcacaaggat 


gtctgggtaa 


2100 


actattcaag 


aaatggcttt 


gaaatacagc 


atgggagctt 


gtctgagttg 


gaatgcagag 


2160 


ttgcactgca 


aaatgtcagg 


aaatggatgt 


ctctcagaat 


gcccaactcc 


aaaggatttt 


2220 


atatgtgtat 


atagtaagca 


gtttcctgat 


tccagcaggc 


caaagagtct 


gctgaatgtt 


2280 


gtgttgccgg agacctgtat 


ttctcaacaa 


ggtaagatgg 


tatcctagca 


actgcggatt 


2340 


ttaatacatt 


ttcagcagaa 


gtacttagtt 


aatctctacc 


tttagggatc 


gtttcatcat 


2400 


ttttagatgt 


tatacttgaa 


atactgcata 


acttttagct 


ttcatgggtt 


cctttttttc 


2460 


agcctttagg agactgttaa gcaatttgct 


gtccaacttt 


tgtgttggtc 


ttaaactgca 


2520 


atagtagttt 


accttgtatt 


gaagaaataa 


agaccatttt 


tatattaaaa 


aatacttttg 


2580 


tctgtcttca 


ttttgacttg 


tctgatatcc 


ttgcagtgcc 


cattatgtca 


gttctgtcag 


2640 


atattcagac atcaaaactt 


aacgtgagct 


cagtggagtt 


acagctgcgg 


ttttgatgct 


2700 


gttattattt 


ctgaaactag 


aaatgatgtt 


gtcttcatct 


gctcatcaaa 


cacttcatgc 


27 60 


agagtgtaag gctagtgaga 


aatgcataca 


tttattgata 


cttttttaaa 


gtcaactttt 


2820 


tatcagattt 


ttttttcatt 


tggaaatata 


ttgttttcta 


gactgcatag 


cttctgaatc 


2880 


tgaaatgcag 


tctgattggc. atgaagaagc 


acagcactct 


tcatcttact 


taaacttcat 


. 2940 


tttggaatga aggaagttaa 


gcaagggcac 


aggtccatga 


aatagagaca 


gtgcgctcag 


3000 


gagaaagtga 


acctggattt 


ctttggctag 


tgttctaaat 


ctgtagtgag 


gaaagtaaca 


3060 


cccgattcct 


tgaaagggct 


ccagctttaa 


tgcttccaaa 


ttgaaggtgg 


caggcaactt 


3120 
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ggccactggt tatttactgc attatgtctc agtttcgcag ctaacctggc ttctccacta 
ttgagcatgg actatagcct ggcttcagag gccaggtgaa ggttgggatg ggtggaagga 
gtgctgggct gtggctgggg ggactgtggg gactccaagc tgagcttggg gtgggcagca 
cagggaaaag tgtgggtaac tatttttaag tactgtgttg caaacgtctc atctgcaaat 
acgtagggtg tgtactctcg aagattaaca gtgtgggttc agtaatatat ggatgaattc 
acagtggaag cattcaaggg tagatcatct aacgacacca gatcatcaag ctatgattgg 
aagcggtatc agaagagcga ggaaggtaag cagtcttcat atgttttccc tccacgtaaa 
gcagtctggg aaagtagcac cccttgagca gagacaagga aataattcag gagcatgtgc 
taggagaact ttcttgctga attctacttg caagagcttt gatgcctggc ttctggtgcc 
ttctgcagca cctgcaaggc ccagagcctg tggtgagctg gagggaaaga ttctgctcaa 
gtccaagctt cagcaggtca ttgtctttgc ttcttccccc agcactgtgc agcagagtgg 
aactgatgtc gaagcctcct gtccactacc tgttgctgca ggcagactgc tctcagaaaa 
agagagctaa ctctatgcca tagtctgaag gtaaaatggg ttttaaaaaa gaaaacacaa 
aggcaaaacc ggctgcccca • tgagaagaaa gcagtggtaa acatggtaga aaaggtgcag 
aagcccccag gcagtgtgac aggcccctcc tgccacctag aggcgggaac aagcttccct 
gcctagggct ctgcccgcga agtgcgtgtt tctttggtgg gttttgtttg gcgtttggtt 
ttgagattta gacacaaggg aagcctgaaa ggaggtgttg ggcactattt tggtttgtaa 

agcctgtact tcaaatatat attttgtgag ggagtgtagc gaattggcca atttaaaata 

« 

aagttgcaag agattgaagg ctgagtagtt gagagggtaa cacgtttaat gagatcttct 
gaaactactg cttctaaaca cttgtttgag tggtgagacc ttggataggt gagtgctctt 
gttacatgtc tgatgcactt gcttgtcctt ttccatccac atccatgcat tccacatcca 
cgcatttgtc acttatccca tatctgtcat atctgacata cctgtctctt cgtcacttgg 
tcagaagaaa cagatgtgat aatccccagc cgccccaagt ttgagaagat ggcagttgct 
tctttccctt tttcctgcta agtaaggatt ttctcctggc tttgacacct cacgaaatag 
tcttcctgcc ttacattctg ggcattattt caaatatctt tggagtgcgc tgctctcaag 
tttgtgtctt cctactctta gagtgaatgc tcttagagtg .aaagagaagg aagagaagat 
gttggccgca gttctctgat gaacacacct ctgaataatg gccaaaggtg ggtgggtttc 
tctgaggaac gggcagcgtt tgcctctgaa agcaaggagc tctgcggagt tgcagttatt 
ttgcaactga tggtggaact ggtgcttaaa gcagattccc taggttccct gctacttctt 



-in on 



3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 
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ttccttcttg gcagtcagtt tattrctgac agacaaacag ccacccccac tgcaggctta 4 920 

gaaagtatgt ggctctgcct gggtgtgtta cagctctgcc ctggtgaaag gggattaaaa 4 980 

cgggcaccat tcatcccaaa caggatcctc attcatggat caagctgtaa ggaacttggg 5040 
ctccaacctc aaaacattaa ttggagtacg aatgtaatta aaactgcatt ctcgcattcc • 5100 

taagtcattt agtctggact ctgcagcatg taggtcggca gctcccactt tctcaaagac 5160 

cactgatgga ggagtagtaa aaatggagac cgattcagaa caaccaacgg agtgttgccg 5220 

aagaaactga tggaaataat gcatgaattg tgtggtggac atttttttta aatacataaa 5280 

ctacttcaaa tgaggtcgga gaaggtcagt gttttattag cagccataaa accaggtgag 5340 

cgagtaccat ttttctctac aagaaaaacg attctgagct ctgcgtaagt ataagttctc 5400 

catagcggct gaagctcccc cctggctgcc tgccatctca gctggagtgc agtgccattt 54 60 

ccttggggtt tctctcacag cagtaatggg acaatacttc acaaaaattc tttcttttcc 5520 

tgtcatgtgg gatccctact gtgccctcct ggttttacgt taccccctga ctgttccatt 5580 

cagcggtttg gaaagagaaa aagaatttgg aaataaaaca tgtctacgtt atcacctcct 5640 

ccagcatttt ggtttttaat tatgtcaata actggcttag atttggaaat gagagggggt 5700 

tgggtgtatt accgaggaac aaaggaaggc ttatataaac tcaagtcttt tatttagaga 5760 

actggcaagc tgtcaaaaac aaaaaggcct taccaccaaa ttaagtgaat agccgctata 5820 

gccagcaggg ccagcacgag ggatggtgca ctgctggcac tatgccacgg cctgcttgtg 5880 

actctgagag caactgcttt ggaaatgaca gcacttggtg caatttcctt tgtttcagaa 5940 

tgcgtagagc gtgtgcttgg cgacagtttt tctagttagg ccacttcttt tttccttctc 6000 

tcctcattct cctaagcatg tctccatgct ggtaatccca gtcaagtgaa cgttcaaaca 6060 

atgaatccat cactgtagga ttctcgtggt gatcaaatct ttgtgtgagg tctataaaat 6120 

atggaagctt atttattttt cgttcttcca tatcagtctt ctctatgaca attcacatcc 6180 

accacagcaa attaaaggtg aaggaggctg gtgggatgaa gagggtcttc tagctttacg 6240 

ttcttccttg caaggccaca ggaaaatgct gagagctgta gaatacagcc tggggtaaga 6300 

agttcagtct cctgctggga cagctaaccg catcttataa ccccttctga gactcatctt 6360 

aggaccaaat agggtctatc tggggttttt gttcctgctg ttcctcctgg aaggctatct 6420 

cactatttca ctgctcccac ggttacaaac caaagataca gcctgaattt tttctaggcc 6480 

acattacata aatttgacct ggtaccaata ttgttctcta tatagttatt tccttcccca 6540 

ctgtgtttaa ccccttaagg cattcagaac aactagaatc atagaatggt ttggattgga 6600 
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aggggcctta 


aacatcatcc 


atttccaacc ctctgccatg 


ggctgcttgc 


cacccaciigg 


f f r r\ 

ooou 


ctcaggctgc 


ccagggcccc 


atccagcctg gccttgagca 


cctccaggga 


tggggcaccc 


6720 


acagcttctc 


tgggcagcct 


gtgccaacac ctcaccactc 


tctgggtaaa gaattctctt 


6780 


ttaacatcta 


atctaaatct 


cttctctttt agtttaaagc 


cattcctctt 


tttcccgttg 


6840 


ctatctgtcc 


aagaaatgtg 


tattggtctc cctcctgctt 


ataagcagga 


agtactggaa 


6900 


ggctgcagtg 


aggtctcccc 


acagccttct cttctccagg 


ctgaacaagc 


ccagctcctt 


6960 


cagcctgtct 


tcgtaggaga 


tcatcttagt ggccctcctc 


tggacccatt ccaacagttc 


7020 


cacggctttc 


ttgtggagcc 


ccaggtctgg atgcagtact 


tcagatgggg 


ccttacaaag 


7080 


gcagagcaga 


tggggacaat 


cgcttacccc tccctgctgg 


ctgcccctgt 


tttgatgcag 


7140 


cccagggtac 


tgttggcctt 


tcaggctccc agaccccttg 


ctgatttgtg 


tcaagctttt 


7200 


catccaccag 


aacccacgct 


tcctggttaa tacttctgcc 


ctcacttctg 


taagcttgtt 


7260 


tcaggagact 


tccattcttt 


aggacagact gtgttacacc 


tacctgccct 


attcttgcat 


7320 


atatacattt 


cagttcatgt 


ttcctgtaac aggacagaat 


atgtattcct 


ctaacaaaaa 


7380 


tacatgcaga 


attcctagtg 


ccatctcagt agggttttca 


tggcagtatt 


agcacatagt 


7440 


caatttgctg 


caagtacctt 


ccaagctgcg gcctcccata 


aatcctgtat 


ttgggatcag 


7500 


ttaccttttg 


gggtaagctt 


ttgtatctgc agagaccctg 


ggggttctga 


tgtgcttcag 


7560 


ctctgctctg 


ttctgactgc 


accattttct agatcaccca 


gttgttcctg 


tacaacttcc 


7620 


ttgtcctcca 


tcctttccca 


gcttgtatct ttgacaaata 


caggcctatt tttgtgtttg 


7680 


cttcagcagc 


catttaattc 


ttcagtgtca tcttgttctg 


ttgatgccac tggaacagga 


7740 


ttttcagcag 


tcttgcaaag 


aacatctagc tgaaaacttt 


ctgccattca 


atattcttac 


7800 


cagttcttct 


tgtttgaggt 


gagccataaa ttactagaac 


ttcgtcactg 


acaagtttat 


7860 


gcattttatt 


acttctatta 


tgtacttact ttgacataac 


acagacacgc acatattttg 


7920 


ctgggatttc 


cacagtgtct 


ctgtgtcctt cacatggttt 


tactgtcata 


cttccgttat 


7980 


aaccttggca 


atctgcccag 


ctgcccatca caagaaaaga 


gattcctttt ttattacttc . 


8040 


tcttcacicca 


ataaacaaaa 


tgtgagaagc ccaaacaaga 


acttgtgggg caggctgcca 


8100 


tcaagggaga 


gacagctgaa 


gggttgtgta gctcaataga 


attaagaaat aataaagctg 


8160 


tgtcagacag 


ttttgcctga 


tttatacagg cacgccccaa 


gccagagagg 


ctgtctgcca 


8220 


aggccacctt 


gcagtccttg 


gtttgtaaga taagtcatag 


gtaacttttc 


tggtgaattg 


8280 


cgtggagaat 


catgatggca 


gttcttgctg tttactatgg 


taagatgcta 


aaataggaga 


8340 
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cagcaaagta acacttgctg 


ctgt aggtgc 


^ « ^ —» 

U ^ \J ^ ^ \^ 


^^iAw<_%<>2»*<^k-*i>*. "3 "3 »j 


8400 


caccaagatg agggatgctc 


ccagctgacg 


gatgctgggg 


cagtaacagt gggtcccatg 


8460 


ctgcctgctc attagcatca 


cctcagccct 


caccagccca 

• 


tcagaaggat catcccaagc 


8520 


tgaggaaagt tgctcatctt 


cttcacatca 


tcaaaccttt 


ggcctgactg atgcctcccg 


8580 


gatgcttaaa tgtggtcact 


gacatcttta 


tttttctatg 


atttcaagtc agaacctccg 


8640 


gatcaggagg gaacacatag 


tgggaatgta 


ccctcagctc 


caaggccaga tcttccttca 


8700 


atgatcatgc atgctactta 


ggaaggtgtg 


tgtgtgtgaa 


tgtagaattg cctttgttat 


8760 


tttttcttcc tgctgtcagg 


aacattttga 


ataccagaga 


aaaagaaaag tgctcttctt 


8820 


ggcatgggag gagttgtcac 


acttgcaaaa 


taaaggatgc 


agtcccaaat gttcataatc 


8880 


tcagggtctg aaggaggatc 


agaaactgtg 


tatacaattt 


caggcttctc tgaatgcagc 


8940 


ttttgaaagc tgttcctggc 


cgaggcagta 


ctagtcagaa 


ccctcggaaa caggaacaaa 


9000 


tgtcttcaag gtgcagcagg 


aggaaacacc 


ttgcccatca 


tgaaagtgaa taaccactgc 


9060 


cgctgaagga atccagctcc 


tgtttgagca 


ggtgctgcac 


actcccacac tgaaacaaca 


9120 


gttcattttt ataggacttc 


caggaaggat 


cttcttctta 


agcttcttaa ttatggtaca 


9180 


tctccagttg gcagatgact 


atgactactg 


acaggagaat 


gaggaactag ctgggaatat 


9240 


ttctgtttga ccaccatgga 


gtcacccatt 


tctttactgg 


tatttggaaa taataattct 


9300 


gaattgcaaa gcaggagtta 


gcgaagatct 


tcatttcttc 


catgttggtg acagcacagt 


9360 


tctggctatg aaagtctgct 


tacaaggaag 


aggataaaaa 


tcatagggat aataaatcta 


9420 


agtttgaaga caatgaggtt 


ttagctgcat 


ttgacatgaa 


gaaattgaga cctctactgg 

• 


9480 


atagctatgg tatttacgtg 


tctttttgct 


tagttactta 


ttgaccccag ctgaggtcaa 


9540 


gtatgaactc aggtctctcg 


ggctactggc 


atggattgat 


tacatacaac tgtaatttta 


9600 


gcagtgattt agggtttatg 


agtacttttg 


cagtaaatca 


tagggttagt aatgttaatc 


9660 


tcagggaaaa aaaaaaaaag 


ccaaccctga 


cagacatccc 


agctcaggtg gaaatcaagg 


9720 


atcacagctc agtgcggtcc 


cagagaacac 


agggactctt 


ctcttaggac ctttatgtac 


9780 


agggcctcaa gataactgat 


gttagtcaga 


agactttcca 


ttctggccac agttcagctg 


9840 


aggcaatcct ggaattttct 


ctccgctgca 


cagttccagt 


catcccagtt tgtacagttc 


9900 


tggcactttt tgggtcaggc 


cgtgatccaa 


ggagcagaag 


ttccagctat ggtcagggag 


9960 


tgcctgaccg tcccaactca 


ctgcactcaa 


acaaaggcga 


aaccacaaga gtggcttttg 


10020 


ttgaaattgc agtgtggccc 


agaggggctg 


caccagtact 


ggattgacca cgaggcaaca 


10080 
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ttaatcctca 


gcaagtgcaa 


tttgcagcca 


ttaaa L tgaa 




/->•(--»<— t» -n f- /r f -a 


10140 


atcagtatca 


acaagtggtt 


tggcttggaa gatggagtct 


aggggctcta 


caggagtagc 


10200 


tactctctaa 


tggagttc^ca 


ttttgaagca ggacactgtg 


aaaagctggc 


ctcctaaaga 


10260 


ggctgctaaa 


cattagggtc 


aattttccag 


tgcactttct 


gaagtgtctg 


cagttcccca 


10320 


tgcaaagctg 


cccaaacata 


gcacttccaa 


ttgaatacaa 


ttatatgcag 


gcgtactgct 


10380 


tcttgccagc 


actgtccttc 


tcaaatgaac 


tcaacaaaca 


atttcaaagt 


ctagtagaaa 


10440 


gtaacaagct 


ttgaatgtca 


ttaaaaagta 


tatctgcttt 


cagtagttca 


gcttatttat 


10500 


gcccactaga 


aacatcttgt 


acaagctgaa 


cactggggct 


ccagattagt 


ggtaaaacct 


10560 


actttataca 


atcatagaat 


catagaatgg 


cctgggttgg 


aagggacccc 


aaggatcatg 


10620 


aagatccaac 


acccccgcca 


caggcagggc 


caccaacctc 


cagatctggt 


actagaccag 


10680 


gcagcccagg 


gctccatcca 


acctggccat gaacacctcc 


agggatggag 


catccacaac 


10740 


ctctctgggc 


agcctgtgcc 


agcacctcac caccctctct 


gtgaagaact 


tttccctgac 


10800 


atccaatcta 


agccttccct 


ccttgaggtt 


agatccactc 


ccccttgtgc 


tatcactgtc 


10860 


tactcttgta 


aaaagttgat 


tctcctcctt 


tttggaaggt 


tgcaatgagg 


tctccttgca 


10920 


gccttcttct 


cttctgcagg 


atgaacaagc 


ccagctccct 


cagcctgtct 


ttataggaga 


10980 


ggtgctccag 


ccctctgatc 


atctttgtgg 


ccctcctctg 


gacccgctcc 


aagagctcca 


11040 


catctttcct 


gtactggggg 


ccccaggcct 


gaatgcagta 


ctccagatgg 


ggcctcaaaa 


11100 


gagcagagta 


aagagggaca 


atcaccttcc 


tcaccctgct 


ggccagccct 


cttctgatgg 


11160 


agccctggat 


acaactggct 


ttctgagctg 


caacttctcc 


ttatcagttc 


cactattaaa 


11220 


acaggaacaa 


tacaacaggt 


gctgatggcc 


agtgcagagt 


ttttcacact 


tcttcatttc 


11280 


ggtagatctt 


agatgaggaa 


cgttgaagtt 


gtgcttctgc 


gtgtgcttct 


tcctcctcaa 


11340 


atactcctgc 


ctgatacctc 


accccacctg 


ccactgaatg 


gctccatggc 


cccctgcagc 


11400 


cagggccctg 


atgaacccgg 


cactgcttca 


gatgctgttt 


aatagcacag 


tatgaccaag 


11460 


ttgcacctat 


gaatacacaa 


acaatgtgtt 


gcatccttca 


gcacttgaga 


agaagagcca 


11520 


aatttgcatt 


gtcaggaaat 


ggtttagtaa ttctgccaat 


taaaacttgt 


ttatctacca 


11580 


tggctgtttt 


tatggctgtt 


agtagtggta cactgatgat 


gaacaatggc 


tatgcagtaa 


11640 


aatcaagact 


gtagatattg 


caacagacta 


taaaattcct 


ctgtggctta 


gccaatgtgg 


11700 


tacttcccac 


attgtataag 


aaatttggca agtttagagc 


aatgtttgaa 


gtgttgggaa 


11760 


. atttctgtat 


actcaagagg 


gc'gtttttga caactgtaga 


acagaggaat 


caaaaggggg 


11820 
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tgggaggaag ttaaaagaag aggcaggtgc aagagagctt gcagtcccgc tgtgtgtacg 11880. 
acactggcaa catgaggtct ttgctaatct tggtgctttg cttcctgccc ctggctgcct 11940 
taggg 

<210> 8 

<211> 285 

<212> DNA. 

<213> SV-40 

<220> 

<221> misc_f eature 
<222> (1)..(285) 

<223> SV40 Polyadenylation Sequence 
<400> 8 

aaagtctaga gtcggggcgg ccggccgctt cgagcagaca tgataagata cattgatgag 60 
tttggacaaa ccacaactag aatgcagtga aaaaaatgct ttatttgtga aatttgtgat 120 
gctattgctt tatttgtaac cattataagc tgcaataaac aagttaacaa caacaattgc 180 
attcatttta tgtttcaggt tcagggggag gtgtgggagg ttttttaaag caagtaaaac 240 
ctctacaaat gtggtaaaat cgataaggat ccgtcgagcg gccgc 285 

<210> 9 

<211> 5972 

<212> DNA 

<213> Gallus gallus 

<220> 

<221> misc feature 

<222> (1) .7(5972) 

<223> Lysozyme 3prime domain 

<400> 9 

cgcgtggtag gtggcggggg gttcccagga gagcccccag cgcggacggc agcgccgtca 60 

ctcaccgctc cgtctccctc cgcccagggt cgcctggcgc aaccgctgca agggcaccga 120 

cgtccaggcg tggatcagag gctgccggct gtgaggagct gccgcgcccg gcccgcccgc 180 

tgcacagccg gccgctttgc gagcgcgacg ctacccgctt ggcagtttta aacgcatccc 240 

tcattaaaac gactatacgc aaacgccttc ccgtcggtcc gcgtctcttt ccgccgccag 300 

ggcgacactc gcggggaggg cgggaagggg gccgggcggg agcccgcggc caaccgtcgc 360 

cccgtgacgg caccgccccg cccccgtgac gcggtgcggg cgccggggcc gtggggctga 420 

gcgctgcggc ggggccgggc cgggccgggg cgggagctga gcgcggcgcg gctgcgggcg 480 



-20- 



WO 03/024199 



PCT/US02/30156 



gcgccccctc cggtgcaata tgttcaagag aatggctgag ttcgggcctg actccggggg 
cagggtgaag gtgcggcgcg ggcggaggga cggggcgggc gcggggccgc ccggcgggtg 
ccggggcctc tgccggcccg cccggctcgg gctgctgcgg cgcttacggg cgcgcttctc 



gccgctgccg cttctcttct ctcccgcgca agggcgtcac catcgtgaag ccggtagtgt 
acgggaacgt ggcgcggtac ttcgggaaga agagggagga ggacgggcac acgcatcagt 

cgtgaagccc tacaggaacg aggtagggcc cgagcgcgtc ggccgccgtt 

agcgagcttc 
gagagctacg 
cccgctgagg 

ctcgtcgccc tcatctttct ttcgtgccgc agtcgttacc aaaccgccgt acgagatcac 



ggacggttta 
ctcggagcgc 
tctcgtagga 
ggaatcctct 



cggagccgtc agcgccgcgc ctgggtgcgc tgtgggacac 
catgtccgcc tacgtgaaaa aaatccagtt caagctgcac 
ccgaggtggg tgttgcgtcg gggggtttgc tccgctcggt 



cgaaacgggc tggggcgaat ttgaaatcat catcaagata tttttcattg 
gcgacccgta agtacgctca gcttctcgta gtgcttcccc cgtcctggcg 

ctgctgccgg tcacagtccc gccagccgcg gagctgactg 
gtgctctgtg ttcggtcagc gaggctatcg ggagggcttt 
gcgcttagcg caggagcacg ttgtgctacg cctgaactac 
aaccgctctc aaactgattt attggcgaaa tggctctaaa 



gggctgctcg 
cccgggacgt 
ggcttctctg 



aggccgtgga 

tcctctcttt ggaaatgctt tagagaaggt ctctgtggta gttcttatgc 
aagcacttgg ccagacaatt taaagacatc aagcagcatt tatagcaggc 
acgaatactg aatttaagta actctgctca cgttgtatga cgtttatttt 
aaagccatta 



ggagataact 
gtaaccctgt 



aaatcctgtg cagttgttta gtaagaacag ctgccactgt 
ggtgtttccc tacagttctc aagctgataa aactctgtct 
atcacttgct gaagcttttt cagtctgaca ccaatgcaat 



aaaactgtag tttctgaatt ctatgatgaa atggtatgaa aattttaatg 
ctgactttat ttaaaaaaaa ttattgatgg tgctgtgtat tttggtcctt 
ttcaagatcc tactgccatg atgcagcaac tgctaacgac gtcccgtcag 
g'tgcttacaa gcatgaaaca gagtgtaagt gcaaaatgag gataccttcg 
ttcactactaatgttttctg tgggatgtga tcgtacagtg agtttggctg 
gaatagcttg gtattggcag tgatgacgtg atcgatgcct tgcttatcat 
aagtagaata aatgcagcct gctttatttg agatagtttg gttcatttta 



atccaaacga 
gcccggggct 
agctcccttt 
ggctgcattt 
agctgtgaga 
ctaaatcgtc 
atctatccta 
acgtttaata 
cgtattcctg 
tttgtatcta 
ttgtatctag 
cctgggaaag 
tcaaccgagc 
ccttagatat 
ctgacacttg 
ccgaccgtca 
tgtgaaattt 
gtttgaaatg 
tggaatgcaa 



c a r\ 



600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
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gcaaagatta 


tacttcctca 


ctgaattgca 


ctgtccaaag 


gtgtgaaatg tgtgyygatu 


noon 

<L <1 U V 


tggaggaccg 


tgaccgaggg 


acattggatc 


gctatctccc 


atttcttttg ctgttaccag 


2340 


ttcagatttt 


cttttcacct 


agtctttaat 

■ 


tcccagggtt 

• 


ttgttttttc cttggtcata 


2400 


gtttttgttt 


ttcactctgg 


caaatgatgt 


tgtgaattac 


actgcttcag ccacaaaact 


2460 


gatggactga 


atgaggtcat 


caaacaaact 


tttcttcttc 


cgtatttcct tttttttccc 


2520 


ccacttatca 


tttttactgc 


tgttgttgag tctgtaaggc 


taaaagtaac tgttttgtgc 


2580 


tttttcagga 


cgtgtgcttt 


ccaaattact 


gccacatata 


taaagaaagg ttggaatttt 


2640 


aaagataatt 


catgtttctt 


Gttctttttt 


gccaccacag 


ttgcagatct tgaagtaaaa 


2700 


accagggaaa 


agctggaagc 


tgccaaaaag 


aaaaccagtt 


ttgaaattgc tgagcttaaa 


2760 


gaaaggttaa 


aagcaagtcg 


tgaaaccatc 


aactgcttaa 


agagtgaaat cagaaaactc 


2820 


gaagaggatg 


atcagtctaa 


agatatgtga 


tgagtgttga 


cttggcaggg agcctataat 


2880 


gagaatgaaa 


ggacttcagt 


cgtggagttg 


tatgcgttct 


ctccaattct gtaacggaga 


2940 


ctgtatgaat 


ttcatttgca 


aatcactgca 


gtgtgtgaca 


actgactttt tataaatggc 


3000 


agaaaacaag 


aatgaatgta 


tcctcatttt 


atagttaaaa 


tctatgggta tgtactggtt 


3060 


tatttcaagg 


agaatggatc 


gtagagactt 


ggaggccaga 


ttgctgcttg tattgactgc 


3120 


atttgagtgg 


tgtaggaaca 


ttttgtctat 


ggtcccgtgt 


tagtttacag aatgccactg 


3180 


ttcactgttt 


tgttttgtat 


tttacttttt 


ctactgcaac 


gtcaaggttt taaaagttga 


3240 


aaataaaaca 


tgcaggtttt 


ttttaaatat 


ttttttgtct 


ctatccagtt tgggcttcaa 


3300 


gtattattgt 


taacagcaag 


tcctgattta 


agtcagaggc 


tgaagtgtaa tggtattcaa 


3360 


gatgcttaag 


tctgttgtca 


gcaaaacaaa 


agagaaaact 


tcataaaatc aggaagttgg 


3420 


catttctaat 


aacttcttta 


tcaacagata agagtttcta 


gccctgcatc tactttcact 


3480 


tatgtagttg 


atgcctttat 


attttgtgtg tttggatgca 


ggaagtgatt cctactctgt 


3540 


tatgtagata 


ttctatttaa 


cacttgtact 


ctgctgtgct 


tagcctttcc ccatgaaaat 


3600 


tcagcggctg 


taaatccccc 


tcttcttttg tagcctcata 


cagatggcag accctcaggc 


3660 


ttataaaggc 


ttgggcatct 


tctttactgc tttgagattc 


tgtgttgcag taacctctgc 


3720 


cagagaggag 


aaaagcccca 


caaacctcat .ccccttcttc 


tatagcaatc agtattacta 


3780 


atgctttgag 


aacagagcac 


tggtttgaaa 


cgtttgataa 


ttagcattta acatggcttg 


3840 


gtaaagatgc 


agaactgaaa 


cagctgtgac 


agtatgaact 


cagtatggag acttcattaa 


3900 


gacaaacagc 


tgttaaaatc 


aggcatgttt 


cattgaggag 


gacggggcaa cttgcaccag 


3960 
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tggtgcccac 


acaaatcctt 


cctggcgctg 


cagaccaatt 


tttctggcat 


tutgactgcc 


-1 AO A 


gttgctgctg 

* 


gtcacagaga 


gcaactattt 


ttatcagcca 


caggcaattt 


gcttgtagta 


4080 


ttttccaagt 


gttgtaggta 


agtataaatg 


catcggctcc 


agagcacttt 


gagtatactt 


4140 


attaaaaaca 


taaatgaaag 


acaaattagc 


tttgcttggg 


tgcacagaac 


atttttagtt 


4200 


ccagcctgct 


ttttggtaga 


agccctcttc 


tgaggctaga 


actgactttg 


acaagtagag 


4260 


aaactggcaa 


cggagctatt 


gctatcgaag 


gatccttgtt 


aacaaagtta 


atcgtctttt 


4320 


aaggtttggt 


ttattcatta 


aatttgcttt 


taagctgtag 


ctgaaaaaga 


acgtgctgtc- 


4380 


ttccatgcac 


caggtggcag 


ctctgtgcaa 


agtgctctct 


ggtctcacca 


gccttttaat 


4440 


tgccgggatt 


ctggcacgtc 


tgagagggct 


cagactggct 


tcgtttgttt 


gaacagcgtg 


4500 


tactgctttc 


tgtagacatg 


gccggtttct 


ctcctgcagc 


ttatgaaact 


gttcacactg 


4560 


aacacactgg 'aacaggttgc 


ccaaggaggc 


cgtggatgcc 


ccatccctgg 


aggcattcaa 


4 620 


ggccaggctg 


gatgtggctc 


tgggcagcct 


ggtctggtgg 


ttggcgatcc 


tgcacatagc 


4680 


agcggggttg 


aaactcgatg 


atcactgtgg 


tccttttcaa 


cccaggctat 


tctatgattc 


4740 


tatgattcaa 


cagcaaatca 


tatgtactga 


gagaggaaac 


aaacacaagt 


gctactgttt 


4800 


gcaagttttg 


ttcatttggt 


aaaagagtca 


ggttttaaaa 


ttcaaaatct 


gtctggtttt 


4 8 60 


ggtgtttttt 


tttttttatt 


tattatttct 


ttggggttct 


ttttgatgct 


ttatctttct 


4920 


ctgccaggac 


tgtgtgacaa 


tgggaacgaa 


aaagaacatg 


ccaggcactg 


tcctggattg 


4980 


cacacgctgg 


ttgcactcag 


tagcaggctc 


agaactgcca 


gtctttccac 


agtattactt 


5040 


tctaaaccta attttaatag 


cgttagtaga 


cttccatcac 


tgggcagtgc 


ttagtgaatg 


5100 


ctctgtgtga 


acgttttact 


tataagcatg ttggaagttt 


tgatgttcct 


ggatgcagta 


5160 


gggaaggaca gattagctat 


gtgaaaagta 


gattctgagt 


atcggggtta 


caaaaagtat 


5220 


agaaacgatg agaaattctt 


gttgtaacta 


attggaattt 


ctttaagcgt 


tcacttatgc 


5280 


tacattcata 


gtatttccat 


ttaaaagtag 


gaaaaggtaa 


aacgtgaaat 


cgtgtgattt 


5340 


tcggatggaa 


caccgccttc 


ctatgcacct 


gaccaacttc 


cagaggaaaa 


gcctattgaa 


5400 


agccgagatt aagccaccaa 


aagaactcat 


ttgcattgga 


atatgtagta 


tttgccctct 


5460 


tcctcccggg taattactat 


actttatagg gtgcttatat 


gttaaatgag 


.tggctggcac 


. 5520 


tttttattct 


cacagctgtg 


gggaattctg tcctctagga 


cagaaacaat 


tttaatctgt 


5580 


tccactggtg actgctttgt 


cagcacttcc 


acctgaagag 


atcaatacac 


tcttcaatgt 


5640 


ctagttctgc aacacttggc 


aaacctcaca 


tcttatttca 


tactctcttc 


atgcctatgc 


5700 
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ttattaaagc 


aataatctgg 


gtaatttttg 


i„ i. j — ^ *- — _ 




q-7 an 


ccgtgtccca 


cctaaagctc 


aattcaggtc 


ctgaatctct 


tcaactctct atagctaaca 


5820 


tgaagaatct 


tcaaaagtta 


ggtctgaggg 


acttaag.gct 


aactgtagat gttgttgcct 


5880 


ggtttctgtg 


ctgaaggccg 


tgtagtagtt 


agagcattca 


acctctagaa gaagcttggc 


5940 


cagctggtcg 


acctgcagat 


ccggccctcg 


ag 




5972 



<210> 10 

<211> 18391 

<212> DNA 

<213> Gallus gallus 

<220> 

<221> misc_f eature 

<222> (1)..(237) 

<223> 5prime matrix (scaffold) attachment region (MAR) 



<220> 

<221> misc_feature ' 

<222> (261) . . (1564) 

<223> 5prime matrix (scaffold) attachment region (MAR) 



<220> 

<221> misc^feature 

<222> (1565) . . (1912) 

<223> Sprime matrix (scaffold) attachment region (MAR) 



<220> 

<221> misc__feature 

<222> (1930) . . (2012) 

<223> 5prime matrix (scaffold) attachment region (MAR) 



<220> 

<221> misc_f eature 

<222> (2013) . . (2671) 

<223> Intrinsically curved DNA 



<220> 

<221> mis cofeature 

<222> (5848) . . (5934) 

<223> Transcription enhancer 



<220> 

<221> misc_feature 

<222> (9160) . . (9325) 

<223> Transcription enhancer 
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<220> 

<221> misc^feature 

<222> (9326) . . (9626) 

<223> Negative regulatory element 



<220> 

<221> misc_feature 

<222> (9621) . . (9660) 

<223> Hormone response element 



<220> 

<221> misc_feature 

<222> (9680) . . (10060) 

<223> Hormone response element 



<220> 

<221> misc_feature 

<222> (10576) . . (10821) 

<223> Chicken CR1 Repeat Sequence 



<220> 

<221> misc_feature 

<222> (10926) . . (11193) 

<223> Chicken CR1 Repeat Sequence 



<220> 

<221> misc_feature 

<222> (11424) . . (11938) 

<223> Lysozyme Proximal Promoter and Lysozyme Signal Peptide 



<220> 

<221> mis cofeature 
<222> (11946) . . (12443) 

<223> human interferon alpha 2b codon-optimized for expression in chick 
ens 



<220> 

<221> mis cofeature 

<222> (12464) . . (18391) 

<223> Chicken Lysozyme 3prime domain 



<400> 10 

tgccgccttc tttgatattc actctgttgt atttcatctc ttcttgccga tgaaaggata 60 

taacagtctg tataacagtc tgtgaggaaa tacttggtat ttcttctgat cagtgttttt 120 

ataagtaatg ttgaatattg gataaggctg tgtgtccttt gtcttgggag acaaagccca 180 

cagcaggtgg tggttggggt ggtggcagct cagtgacagg agaggttttt ttgcctgttt 240 
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tttttttttt 


tttttttttt 


aagtaaggtg 


ttcttctttc 


ttagtaaat t 


t tctdCtgga 




ctgtatgttt 


tgacaggtca 


gaaacatttc 


ttcaaaagaa 


gaaccttttg 


gaaactgtac 


360 


agcccttttc 


tttcattccc 


tttttgcttt 


ctgtgccaat 


gcctttggtt 


ctgattgcat 


420 


tatggaaaac 


gttgatcgga 


acttgaggtt 


tttatttata 


gtgtggcttg 


aaagcttgga 


480 


tagctgttgt 


tacacgagat 


accttattaa 


gtttaggcca 


gcttgatgct 


ttattttttc 


540 


cctttgaagt 


agtgagcgtt 


ctctggtttt 


tttcctttga 


aactggtgag 


gcttagattt 


600 


ttctaatggg 


attttttacc 


tgatgatcta 


gttgcatacc 


caaatgcttg 


taaatgtttt 


660 


cctagttaac 


atgttgataa 


cttcggattt 


acatgttgta 


tatacttgtc 


atctgtgttt 


720 


ctagtaaaaa 


tatatggcat 


ttatagaaat 


acgtaattcc 


tgatttcctt 


tttttttatc 


780 


tctatgctct 


gtgtgtacag 


gtcaaacaga 


cttcactcct 


atttttattt 


atagaatttt 


840 


atatgcagtc 


tgtcgttggt 


tcttgtgttg 


taaggataca 


gccttaaatt 


tcctagagcg 


900 


atgctcagta 


aggcgggttg 


tcacatgggt 


tcaaatgtaa 


aacgggcacg 


tttggctgct 


960 


gccttcccga 


gatccaggac 


actaaactgc 


ttctgcactg 


aggtataaat 


cgcttcagat 


1020 


cccagggaag 


tgcagatcca 


cgtgcatatt 


cttaaagaag 


aatgaatact 


ttctaaaata 


1080 


ttttggcata 


ggaagcaagc 


tgcatggatt 


tgtttgggac 


ttaaattatt 


ttggtaacgg 


1140 


agtgcatagg 


ttttaaacac 


agttgcagca 


tgctaacgag 


tcacagcgtt 


tatgcagaag 


1200 


tgatgcctgg 


atgcctgttg 


cagctgttta 


cggcactgcc 


ttgcagtgag 


cattgcagat 


1260 


aggggtgggg 


tgctttgtgt 


cgtgttccca 


cacgctgcca 


cacagccacc 


tcccggaaca 


1320 


catctcacct 


gctgggtact 


tttcaaacca 


tcttagcagt 


agtagatgag 


ttactatgaa 


1380 


acagagaagt 


tcctcagttg 


gatattctca 


tgggatgtct 


tttttcccat 


gttgggcaaa 


1440 


gtatgataaa 


gcatctctat 


ttgtaaatta 


tgcacttgtt 


agttcctgaa 


tcctttctat 


1500 


agcaccactt 


attgcagcag 


gtgtaggctc 


tggtgtggcc 


tgtgtctgtg 


cttcaatctt 


1560 


ttaaagcttc 


tttggaaata 


cactgacttg 


attgaagtct 


cttgaagata 


gtaaacagta 


1620 


cttacctttg 


atcccaatga 


aatcgagcat 


ttcagttgta 


aaagaattcc 


gcctattcat 


1680 


accatgtaat 


gtaattttac 


acccccagtg 


ctgacacttt 


ggaatatatt 


caagtaatag 


1740 


actttggcct 


caccctcttg 


tgtactgtat 


tttgtaatag 


aaaatatttt 


aaactgtgca 


1800 


tatgattatt 


acattatgaa 


agagacattc 


tgctgatctt 


caaatgtaag 


aaaatgagga 


1860 


gtgcgtgtgc 


ttttataaat 


acaagtgatt 


gcaaattagt 


gcaggtgtcc 


ttaaaaaaaa 


1920 


aaaaaaaaag 


taatataaaa 


aggaccaggt 


gttttacaag 


tgaaatacat 


tcctatttgg 


1980 
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taaacagtta catttttatg 


aagattacca 


gcgctgctga ctttetaaac 






attgtcttcc tgtaccattg 


catttcctca 


ttcccaattt gcacaaggat 


gtctgggtaa 


2100 


actattcaag aaatggcttt 


gaaatacagc 


atgggagctt gtctgagttg 


gaatgcagag 


2160 


ttgcactgca aaatgtcagg 


aaatggatgt 


ctctcagaat gcccaactcc 


aaaggatttt 


2220 


atatgtgtat atagtaagca 


gtttcctgat 


tccagcaggc caaagagtct 


gctgaatgtt 


2280 


gtgttgccgg agacctgtat 


ttctcaacaa 


ggtaagatgg tatcctagca 


actgcggatt 


2340 


ttaatacatt ttcagcagaa 


gtacttagtt 


aatctctacc tttagggatc 


gtttcatcat 


2400 


ttttagatgt tatacttgaa 


atactgcata 


acttttagct ttcatgggtt 


cctttttttc 


2460 


agcctttagg agactgttaa 


gcaatttgct 


gtccaacttt tgtgttggtc 


ttaaactgca 


2520 


atagtagttt accttgtatt 


gaagaaataa 


agaccatttt tatattaaaa 


aatacttttg 


2580 


tctgtcttca ttttgacttg 


tctgatatcc 


ttgcagtgcc cattatgtca 


gttctgtcag 


2640 


atattcagac atcaaaactt 


aacgtgagct 


cagtggagtt acagctgcgg 


ttttgatgct 


2700 


gttattattt ctgaaactag 


aaatgatgtt 


gtcttcatct gctcatcaaa 


cacttcatgc 


2760 


agagtgtaag gctagtgaga 


aatgcataca 


tttattgata cttttttaaa 


gtcaactttt 


2820 


tatcagattt ttttttcatt 


tggaaatata 


ttgttttcta gactgcatag 


cttctgaatc 


2880 


tgaaatgcag tctgattggc 


atgaagaagc 


acagcactct tcatcttact 


taaacttcat 


2940 


tttggaatga aggaagttaa 


gcaagggcac 


aggtccatga aatagagaca 


gtgcgctcag 


3000 


gagaaagtga acctggattt 


ctttggctag 


tgttctaaat ctgtagtgag 


gaaagtaaca 


3060 


cccgattcct tgaaagggct 


ccagctttaa 


tgcttccaaa ttgaaggtgg 


caggcaactt 


3120 


ggccactggt tatttactgc 


attatgtctc 


agtttcgcag ctaacctggc 


ttctccacta 


3180 


ttgagcatgg actatagcct 


ggcttcagag 


gccaggtgaa ggttgggatg 


ggtggaagga 


3240 


gtgctgggct gtggctgggg 

) 

cagggaaaag tgtgggtaac 


ggactgtggg 


gactccaagc tgagcttggg 


gtgggcagca 


3300 


tatttttaag 


tactgtgttg caaacgtctc 


atctgcaaat 


3360 


acgtagggtg tgtactctcg 


aagattaaca 


gtgtgggttc agtaatatat 


ggatgaattc 


3420 


acagtggaag cattcaaggg 

■ 


tagatcatct 


aacgacacca gatcatcaag 


ctatgattgg 


3480 


aagcggtatc agaagagcga 


ggaaggtaag 


cagtcttcat atgttttccc 


tccacgtaaa 


3540 


gcagtctggg aaagtagcac 


cccttgagca 


gagacaagga aataattcag 


gagcatgtgc 


3600 


taggagaact ttcttgctga 


attctacttg 


caagagcttt gatgcctggc 


ttctggtgcc 


3660 


ttctgcagca cctgcaaggc 


ccagagcctg 


tggtgagctg gagggaaaga 


ttctgctcaa 


3720 
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gcccaagctt 


cagcaggtca 


ttgtctttgc 


ttcttccccc 


— — . ^— . — ^ * -4- 4— „ 


—i — \ r ^> j-v r r-w 




aactgatgtc 


gaagcctcct 


gtccactacc 


tgttgctgca 


ggcagactgc 


tctcagaaaa 


3840 


agagagctaa 


ctctatgcca 


tagtctgaag 


gtaaaatggg 


ttttaaaaaa 


gaaaacacaa 


3900 


aggcaaaacc 


ggctgcccca 


tgagaagaaa 


gcagtggtaa 


acatggtaga 


aaaggtgcag 


3960 


aagcccccag 


gcagtgtgac 


aggcccctcc 


tgccacctag 


aggcgggaac 


aagcttccct 


4020 


gcctagggct 


ctgcccgcga 


agtgcgtgtt 


tctttggtgg 


gttttgtttg 


gcgtttggtt 


4080 


ttgagattta 


gacacaaggg 


aagcctgaaa 


ggaggtgttg 


ggcactattt 


tggtttgtaa 


4140 


agcctgtact 


tcaaatatat 


attttgtgag 


ggagtgtagc 


gaattggcca 


atttaaaata 


4200 


aagttgcaag 


agattgaagg 


ctgagtagtt 


gagagggtaa 


cacgtttaat 


gagatcttct 


4260 


gaaactactg 


cttctaaaca 


cttgtttgag 


tggtgagacc 


ttggataggt 


gagtgctctt 


4320 


gttacatgtc- 


tgatgcactt 


gcttgtcctt 


ttccatrccac 


atccatgcat 


tccacatcca 


4380 


cgcatttgtc 


acttatccca 


tatctgtcat 


atctgacata 


cctgtctctt 


cgtcacttgg 


4440 


tcagaagaaa 


cagatgtgat 


aatccccagc 


cgccccaagt 


ttgagaagat 


ggcagttgct 


4500 


tctttccctt 


tttcctgcta 


agtaaggatt 


ttctcctggc 


tttgacacct 


cacgaaatag 


4560 


tcttcctgcc 


ttacattctg 


ggcattattt 


caaatatctt 


tggagtgcgc 


tgctctcaag 


4620 


tttgtgtctt 


cctactctta 


gagtgaatgc 


tcttagagtg 


aaagagaagg 


aagagaagat 


4680 


gttggccgca 


gttctctgat 


gaacacacct 


ctgaataatg 


gccaaaggtg 


ggtgggtttc 


4740 


tctgaggaac 


gggcagcgtt 


tgcctctgaa 


agcaaggagc 


tctgcggagt 


tgcagttatt 


4800 


ttgcaactga 


tggtggaact 


ggtgcttaaa 


gcagattccc 


taggttccct 


gctacttctt 


4860 


ttccttcttg 


gcagtcagtt 


tatttctgac 


agacaaacag 


ccacccccac 


tgcaggctta 


4920 


gaaagtatgt 


ggctctgcct 


gggtgtgtta 


cagctctgcc 


ctggtgaaag 


gggattaaaa 


4980 


cgggcaccat 


tcatcccaaa 


caggatcctc 


attcatggat 


caagctgtaa 


ggaacttggg 


5040 


ctccaacctc 


aaaacattaa 


ttggagtacg 


aatgtaatta 


aaactgcatt 


ctcgcattcc 


5100 


taagtcattt 


agtctggact 


ctgcagcatg 


taggtcggca 


gctcccactt 


tctcaaagac 


5160 


cactgatgga 


ggagtagtaa 


aaatggagac 


cgattcagaa 


caaccaacgg 


agtgttgccg 


5220 


aagaaactga 


tggaaataat 


gcatgaattg 


tgtggtggac 


atttttttta 


aatacataaa 


5280 


ctacttcaaa 


tgaggtcgga 


gaaggtcagt 


gttttattag 


cagccataaa 


accaggtgag 


5340 


cgagtaccat 


ttttctctac 


aagaaaaacg 


attctgagct 


ctgcgtaagt 


ataagttctc 


5400 


catagcggct 


gaagctcccc 


cctggctgcc 


tgccatctca 


gctggagtgc 


agtgccattt 


5460 
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ccttggggtt 


tctctcacag 


cagtaatggg 


acaatactte aeaaaaattc 


1- l- L. — J, 1-4.. l_ . 


5520 


tgtcatgtgg 


gatccctact 


gtgccctcct 

• 


ggttttacgt taccccctga 


ctgttccatt 


5580 


cagcggtttg 


gaaagagaaa 

• 


aagaattt'gg 


aaataaaaca tgtctacgtt 

• 


atcacctcct 


5640 


ccagcatttt 


ggtttttaat 


tatgtcaata 


actggcttag atttggaaat 


gagagggggt 


5700 


tgggtgtatt 


accgaggaac 


aaaggaaggc 


ttatataaac tcaagtcttt 


tatttagaga 


57 60 


actggcaagc 


tgtcaaaaac 


aaaaaggcct 


taccaccaaa ttaagtgaat 


agccgctata 


5820 


gccagcaggg 


ccagcacgag 


ggatggtgca 


ctgctggcac tatgccacgg 


cctgcttgtg 


5880 


actctgagag 


caactgcttt 


ggaaatgaca 


gcacttggtg caatttcctt 


tgtttcagaa 


5940 


tgcgtagagc 


gtgtgcttgg 


cgacagtttt 


tctagttagg ccacttcttt 


tttccttctc 


6000 


tcctcattct 


cctaagcatg 


tctccatgct 


ggtaatccca gtcaagtgaa 


cgttcaaaca 


6060 


atgaatccat 


cactgtagga 


ttctcgtggt 


gatcaaatct ttgtgtgagg 


tctataaaat 


6120 


atggaagctt 


atttattttt 


cgttcttcca 


tatcagtctt ctctatgaca 


attcacatcc 


6180 


accacagcaa 


attaaaggtg 


aaggaggctg 


gtgggatgaa gagggtcttc 


tagctttacg 


6240 


ttcttccttg 


caaggccaca 


ggaaaatgct 


gagagctgta gaatacagcc 


tggggtaaga 


6300 


agttcagtct 


cctgctggga 


cagctaaccg 


catcttataa ccccttctga 


gactcatctt 


6360 


aggaccaaat 


agggtctatc 


tggggttttt 


gttcctgctg ttcctcctgg 


aaggctatct 


6420 


cactatttca 


ctgctcccac 


ggttacaaac 


caaagataca gcctgaattt 


tttctaggcc 


6480 


acattacata 


aatttgacct 


ggtaccaata 


ttgttctcta tatagttatt 


tccttcccca 


6540 


ctgtgtttaa 


ccccttaagg 


cattcagaac 


aactagaatc atagaatggt 


ttggattgga 


6600 


aggggcctta 


aacatcatcc 


atttccaacc 


ctctgccatg ggctgcttgc 


cacccactgg 


6660 


ctcaggctgc 


ccagggcccc 


atccagcctg 


gccttgagca cctccaggga 


tggggcaccc 


6720 


acagcttctc 


tgggcagcct 


gtgccaacac 


ctcaccactc tctgggtaaa 


gaattctctt 


6780 


ttaacatcta 


atctaaatct 


cttctctttt 


agtttaaagc cattcctctt 


tttcccgttg 


6840 


ctatctgtcc 


aagaaatgtg 


tattggtctc 


cctcctgctt ataagcagga 


agtactggaa 


6900 


ggctgcagtg 


aggtctcccc 


acagccttct 


cttctccagg ctgaacaagc 


ccagctcctt 


6960 


cagcctgtct 


tcgtaggaga 


tcatcttagt 


ggccctcctc tggacccatt 


ccaacagttc 


7020 


cacggctttc 


ttgtggagcc 


ccaggtctgg 


atgcagtact tcagatgggg 


ccttacaaag 


7080 


gcagagcaga 


tggggacaat 


cgcttacccc 


tccctgctgg ctgcccctgt 


tttgatgcag 


7140 


cccagggtac 


tgttggcctt 


tcaggctccc 


agaccccttg ctgatttgtg 


tcaagctttt 


7200 
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catccaccag aacccacgct tcctggttaa tacttctgcc ctcacttctg taagettgtt 



tcaggagact tccattcttt aggacagact gtgttacacc tacctgccct 

• ■ 

atatacattt cagttcatgt ttcctgtaac aggacagaat atgtattcct 



attcttgcat 



ctaacaaaaa 



tacatgcaga attcctagtg ccatctcagt agggttttca tggcagtatt agcacatagt 
caatttgctg caagtacctt ccaagctgcg gcctcccata aatcctgtat ttgggatcag 
ttaccttttg gggtaagctt ttgtatctgc agagaccctg ggggttctga tgtgcttcag 
ctctgctctg ttctgactgc accattttct agatcaccca gttgttcctg tacaacttcc 
ttgtcctcca tcctttccca gcttgtatct ttgacaaata caggcctatt tttgtgtttg 
cttcagcagc catttaattc ttcagtgtca tcttgttctg ttgatgccac tggaacagga 
ttttcagcag tcttgcaaag aacatctagc tgaaaacttt ctgccattca atattcttac 
cagttcttct tgtttgaggt gagccataaa ttactagaac ttcgtcactg acaagtttat 
gcattttatt acttctatta tgtacttact ttgacataac acagacacgc acatattttg 
ctgggatttc cacagtgtct ctgtgtcctt cacatggttt tactgtcata cttccgttat 
aaccttggca atctgcccag ctgcccatca caagaaaaga gattcctttt ttattacttc 
tcttcagcca ataaacaaaa tgtgagaagc ccaaacaaga acttgtgggg 
tcaagggaga gacagctgaa gggttgtgta gctcaataga attaagaaat 
tgtcagacag ttttgcctga tttatacagg cacgccccaa gccagagagg 
aggccacctt gcagtccttg gtttgtaaga taagtcatag gtaacttttc 
cgtggagaat catgatggca gttcttgctg tttactatgg taagatgcta 
cagcaaagta acacttgctg ctgtaggtgc tctgctatcc agacagcgat 
caccaagatg agggatgctc ccagctgacg gatgctgggg cagtaacagt 



tcagaaggat 
ggcctgactg 

gatgcttaaa tgtggtcact gacatcttta tttttctatg atttcaagtc 



ctgcctgctc attagcatca cctcagccct caccagccca 
tgaggaaagt tgctcatctt cttcacatca tcaaaccttt 



gatcaggagg gaacacatag tgggaatgta ccctcagctc caaggccaga 
atgatcatgc atgctactta ggaaggtgtg tgtgtgtgaa tgtagaattg 
tttttcttcc tgctgtcagg aacattttga ataccagaga aaaagaaaag 
ggcatgggag gagttgtcac acttgcaaaa taaaggatgc agtcccaaat 
tcagggtctg aaggaggatc agaaactgtg tatacaattt caggcttctc 



caggctgcca 
aataaagctg 
ctgtctgcca 
tggtgaattg 
aaataggaga 
ggcactcgca 
gggtcccatg 
catcccaagc 
atgcctcccg 
agaacctccg 
tcttccttca 
cctttgttat 
tgctcttctt 
gttcataatc 
tgaatgcagc 



7320 

7380 

7440 

7500 

7560 

7620 

7680 

7740 

7800 

7860 

7920 

7980 

8040 

8100 

8160 

8220 

8280 

8340 

8400 

8460 

8520 

8580 

8640 

8700 

8760 

8820 

8880 

8940 
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ttttgaaagc tgttcctggc cgaggcagta ctagtcagaa ccctcggaaa caggaacaaa 

tgtcttcaag gtgcagcagg aggaaacacc ttgcccatca tgaaagtgaa taaccactgc 

■ 

cgctgaagga atccagctcc tgtttgagca ggtgctgcac actcccacac tgaaacaaca 
gttcattttt ataggacttc caggaaggat cttcttctta agcttcttaa ttatggtaca 
tctccagttg gcagatgact atgactactg acaggagaat gaggaactag ctgggaatat 
ttctgtttga ccaccatgga gtcacccatt tctttactgg tatttggaaa taataattct 
gaattgcaaa gcaggagtta gcgaagatct tcatttcttc catgttggtg acagcacagt 
tctggctatg aaagtctgct tacaaggaag aggataaaaa tcatagggat aataaatcta 
agtttgaaga caatgaggtt ttagctgcat ttgacatgaa gaaattgaga cctctactgg 
atagctatgg tatttacgtg tctttttgct tagttactta ttgaccccag ctgaggtcaa 
gtatgaactc aggtctctcg ggctactggc atggattgat tacatacaac tgtaatttta 
gcagtgattt agggtttatg agtacttttg cagtaaatca tagggttagt aatgttaatc 
tcagggaaaa aaaaaaaaag ccaaccctga cagacatccc agctcaggtg gaaatcaagg 
atcacagctc agtgcggtcc cagagaacac agggactctt ctcttaggac ctttatgtac 
agggcctcaa gataactgat gttagtcaga agactttcca ttctggccac agttcagctg 
aggcaatcct ggaattttct ctccgctgca cagttccagt catcccagtt tgtacagttc 
tggcactttt tgggtcaggc cgtgatccaa ggagcagaag ttccagctat ggtcagggag 
tgcctgaccg tcccaactca ctgcactcaa acaaaggcga aaccacaaga gtggcttttg 
" ttgaaattgc agtgtggccc agaggggctg caccagtact ggattgacca cgaggcaaca 
ttaatcctca gcaagtgcaa tttgcagcca ttaaattgaa ctaactgata ctacaatgca 
atcagtatca acaagtggtt tggcttggaa gatggagtct aggggctcta caggagtagc 
tactctctaa tggagttgca ttttgaagca ggacactgtg aaaagctggc ctcctaaaga 
ggctgctaaa cattagggtc aattttccag tgcactttct gaagtgtctg cagttcccca 
tgcaaagctg cccaaacata gcacttccaa ttgaatacaa ttatatgcag gcgtactgct 
tcttgccagc actgtccttc tcaaatgaac tcaacaaaca atttcaaagt ctagtagaaa 
gtaacaagct ttgaatgtca ttaaaaagta tatctgcttt cagtagttca gcttatttat 
gcccactaga aacatcttgt acaagctgaa cactggggct ccagattagt ggtaaaacct 
actttataca atcatagaat catagaatgg cctgggttgg aagggacccc aaggatcatg 
aagatccaac acccccgcca caggcagggc caccaacctc cagatctggt actagaccag 



Qfinn 

W M W 

9060 
9120 ' 
9180 
9240 
9300 
9360 
9420 
9480 
9540 
9600 
9660 
9720 
9780 
9840 
9900 
9960 
10020 
10080 
10140 
10200 
10260 
10320 
10380 
10440 
10500 
10560 
10620 
10680 
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gcagcccagg gctccatcca acctggccat gaacacctcc agggatggag catccacaac 1Q740 

ctctctgggc agcctgtgcc agcacctcac caccctctct gtgaagaact tttccctgac 10800 

- ■ 

atccaatcta agccttccct ccttgaggtt agatccactc ccccttgtgc tatcactgtc 10860 

tactcttgta aaaagttgat tctcctcctt tttggaaggt tgcaatgagg tctccttgca 10920 

gccttcttct cttctgcagg atgaacaagc ccagctccct cagcctgtct ttataggaga 10980 

ggtgctccag ccctctgatc atctttgtgg ccctcctctg gacccgctcc aagagctcca 11040 

catctttcct gtactggggg ccccaggcct gaatgcagta ctccagatgg ggcctcaaaa 11100 

gagcagagta aagagggaca atcaccttcc tcaccctgct ggccagccct cttctgatgg 11160 

agccctggat acaactggct ttctgagctg caacttctcc ttatcagttc cactattaaa 11220 

acaggaacaa tacaacaggt gctgatggcc agtgcagagt ttttcacact tcttcatttc 11280 

ggtagatctt agatgaggaa cgttgaagtt gtgcttctgc gtgtgcttct tcctcctcaa 11340 

atactcctgc ctgatacctc accccacctg ccactgaatg gctccatggc cccctgcagc 11400 

cagggccctg atgaacccgg cactgcttca gatgctgttt aatagcacag tatgaccaag 114 60 

ttgcacctat gaatacacaa acaatgtgtt gcatccttca gcacttgaga agaagagcca 11520 

aatttgcatt gtcaggaaat ggtttagtaa ttctgccaat taaaacttgt ttatctacca 11580 

tggctgtttt tatggctgtt agtagtggta cactgatgat gaacaatggc tatgcagtaa 11640 

aatcaagact gtagatattg caacagacta taaaattcct ctgtggctta gccaatgtgg 11700 

tacttcccac attgtataag aaatttggca agtttagagc aatgtttgaa gtgttgggaa 11760 

atttctgtat actcaagagg gcgtttttga caactgtaga acagaggaat caaaaggggg 11820 

tgggaggaag ttaaaagaag aggcaggtgc aagagagctt gcagtcccgc tgtgtgtacg 11880 

acactggcaa catgaggtct ttgctaatct tggtgctttg cttcctgccc ctggctgcct 11940 

tagggtgcga tctgcctcag acccacagcc tgggcagcag gaggaccctg atgctgctgg 12000 

ctcagatgag gagaatcagc ctgtttagct gcctgaagga taggcacgat tttggctttc 12060 

ctcaagagga gtttggcaac cagtttcaga aggctgagac catccctgtg ctgcacgaga 12120 

tgatccagca gatctttaac ctgtttagca ccaaggatag cagcgctgct tgggatgaga 12180 

ccctgctgga taagttttac accgagctgt accagcagct gaacgatctg gaggcttgcg 12240 

tgatccaggg cgtgggcgtg accgagaccc ctctgatgaa ggaggatagc atcctggctg 12300 

tgaggaagta ctttcagagg atcaccctgt acctgaagga gaagaagtac agcccctgcg 12360 

cttgggaagt cgtgagggct gagatcatga ggagctttag cctgagcacc aacctgcaag 12420 
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agagcttgag gtctaaggag taaaaagtct agagtcgggg cggcgcgtgg taggtggcqy 12480 

ggggttccca ggagagcccc cagcgcggac ggcagcgccg tcactcaccg ctccgtctcc 12540 

ctccgcccag ggtcgcctgg cgcaaccgct gcaagggcac cgacgtccag gcgtggatca 12600 

gaggctgccg gctgtgagga gctgccgcgc ccggcccgcc cgctgcacag ccggccgctt 12660 

tgcgagcgcg acgctacccg cttggcagtt ttaaacgcat ccctcattaa aacgactata 12720 

cgcaaacgcc ttcccgtcgg tccgcg.tctc tttccgccgc cagggcgaca ctcgcgggga 12780 

gggcgggaag ggggccgggc gggagcccgc ggccaaccgt cgccccgtga cggcaccgcc 12840 

ccgcccccgt gacgcggtgc gggcgccggg gccgtggggc tgagcgctgc ggcggggccg 12900 

ggccgggccg gggcgggagc tgagcgcggc gcggctgcgg gcggcgcccc ctccggtgca 12960 

atatgttcaa gagaatggct gagttcgggc ctgactccgg gggcagggtg aaggtgcggc 13020 

gcgggcggag ggacggggcg ggcgcggggc cgcccggcgg gtgccggggc ctctgccggc 13080 

ccgcccggct cgggctgctg cggcgcttac gggcgcgctt ctcgccgctg ccgcttctct 13140 

tctctcccgc gcaagggcgt caccatcgtg aagccggtag tgtacgggaa cgtggcgcgg 13200 

tacttcggga agaagaggga ggaggacggg cacacgcatc agtggacggt ttacgtgaag 13260 

ccctacagga acgaggtagg gcccgagcgc gtcggccgcc gttctcggag cgccggagcc 13320 

gtcagcgccg cgcctgggtg cgctgtggga cacagcgagc ttctctcgta ggacatgtcc 13380 

gcctacgtga aaaaaatcca gttcaagctg cacgagagct acgggaatcc tctccgaggt 13440 

gggtgttgcg tcggggggtt tgctccgctc ggtcccgctg aggctcgtcg ccctcatctt 13500 

tctttcgtgc cgcagtcgtt accaaaccgc cgtacgagat caccgaaacg ggctggggcg 13560 

aatttgaaat catcatcaag atatttttca ttgatccaaa cgagcgaccc gtaagtacgc 13620 

tcagcttctc gtagtgcttc ccccgtcctg gcggcccggg gctgggctgc tcgctgctgc 13680 

cggtcacagt cccgccagcc gcggagctga ctgagctccc tttcccggga cgtgtgctct 13740 

gtgttcggtc agcgaggcta tcgggagggc tttggctgca tttggcttct ctggcgctta 13800 

gcgcaggagc acgttgtgct acgcctgaac tacagctgtg agaaggccgt ggaaaccgct 13860 

ctcaaactga tttattggcg aaatggctct aaactaaatc gtctcctctc tttggaaatg 13920 

<:tttagagaa ggtctctgtg gtagttctta tgcatctatc ctaaagcact tggccagaca 13980. 

atttaaagac atcaagcagc atttatagca ggcacgttta ataacgaata ctgaatttaa 14040 

gtaactctgc tcacgttgta tgacgtttat tttcgtattc ctgaaagcca ttaaaatcct 14100 

gtgcagttgt ttagtaagaa cagctgccac tgttttgtat ctaggagata actggtgttt 14160 
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ccctacagtt cccaagcrga taaaactctg cccLtgtauu ^y^^- — w ' * • 

gctgaagctt tttcagtctg acaccaatgc aatcctggga aagaaaactg tagtttctga 14280 

attctatgat gaaatggtat gaaaatttta atgtcaaccg agcctgactt tatttaaaaa 14340 

aaattattga tggtgctgtg tattttggtc cttccttaga tatttcaaga tcctactgcc 14 400 

atgatgcagc aactgctaac gacgtcccgt cagctgacac ttggtgctta caagcatgaa 14460 

acagagtgta agtgcaaaat gaggatacct tcgccgaccg tcattcacta ctaatgtttt 14520 

ctgtgggatg tgatcgtaca gtgagtttgg ctgtgtgaaa tttgaatagc ttggtattgg 14580 

cagtgatgac gtgatcgatg ccttgcttat catgtttgaa atgaagtaga ataaatgcag 14 640 

cctgctttat ttgagatagt ttggttcatt ttatggaatg caagcaaaga ttatacttcc 14700 

tcactgaatt gcactgtcca aaggtgtgaa atgtgtgggg atctggagga ccgtgaccga 14760 

gggacattgg atcgctatct cccatttctt ttgctgttac cagttcagat tttcttttca 14820 

cctagtcttt aattcccagg gttttgtttt ttccttggtc atagtttttg tttttcactc 14880 

tggcaaatga tgttgtgaat tacactgctt cagccacaaa actgatggac tgaatgaggt 14940 

catcaaacaa acttttcttc ttccgtattt cctttttttt cccccactta tcatttttac 15000 

tgctgttgtt gagtctgtaa ggctaaaagt aactgttttg tgctttttca ggacgtgtgc 15060 

tttccaaatt actgccacat atataaagaa aggttggaat tttaaagata attcatgttt 15120 

cttcttcttt tttgccacca cagttgcaga tcttgaagta aaaaccaggg aaaagctgga 15180 

agctgccaaa aagaaaacca gttttgaaat tgctgagctt aaagaaaggt taaaagcaag 15240 

tcgtgaaacc atcaactgct taaagagtga aatcagaaaa ctcgaagagg atgatcagtc 15300 

taaagatatg tgatgagtgt tgacttggca gggagcctat aatgagaatg aaaggacttc 15360 

agtcgtggag ttgtatgcgt tctctccaat tctgtaacgg agactgtatg aatttcattt 15420 

gcaaatcact gcagtgtgtg acaactgact ttttataaat ggcagaaaac aagaatgaat 15480 

gtatcctcat tttatagtta aaatctatgg gtatgtactg gtttatttca aggagaatgg 15540 

atcgtagaga cttggaggcc agattgctgc ttgtattgac tgcatttgag tggtgtagga 15600 

acattttgtc tatggtcccg tgttagttta cagaatgcca ctgttcactg ttttgttttg 15660 

tattttactt tttctactgc aacgtcaagg ttttaaaagt tgaaaataaa acatgcaggt 15720. 

tttttttaaa tatttttttg tctctatcca gtttgggctt caagtattat tgttaacagc 15780 

aagtcctgat ttaagtcaga ggctgaagtg taatggtatt caagatgctt aagtctgttg 15840 

tcagcaaaac aaaagagaaa acttcataaa atcaggaagt tggcatttct aataacttct 15900 
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ttatcaacag ataagagttt ctagccctgc atctactttc acttatytag ttgatgcctt 15960 

tatattttgt gtgtttggat gcaggaagtg attcctactc tgttatgtag atattctatt 16020 

■ ■ • * * 

taacacttgt actctgctgt gcttagcctt tccccatgaa aattcagcgg ctgtaaatcc 16080 

ccctcttctt ttgtagcctc atacagatgg cagaccctca ggcttataaa ggcttgggca 16140 

tcttctttac tgctttgaga ttctgtgttg cagtaacctc tgccagagag gagaaaagcc '16200 

ccacaaacct catccccttc ttctatagca atcagtatta ctaatgcttt gagaacagag 16260 

cactggtttg aaacgtttga taattagcat ttaacatggc ttggtaaaga tgcagaactg 16320 

aaacagctgt gacagtatga actcagtatg gagacttcat taagacaaac agctgttaaa 16380 

atcaggcatg tttcattgag gaggacgggg caacttgcac cagtggtgcc cacacaaatc 164 40 

cttcctggcg ctgcagacca atttttctgg cattctgact gccgttgctg ctggtcacag 16500 

agagcaacta tttttatcag ccacaggcaa tttgcttgta gtattttcca agtgttgtag 16560 

gtaagtataa atgcatcggc tccagagcac tttgagtata cttattaaaa acataaatga 16620 

aagacaaatt agctttgctt gggtgcacag aacattttta gttccagcct gctttttggt 16680 

agaagccctc ttctgaggct agaactgact ttgacaagta gagaaactgg caacggagct 16740 

attgctatcg aaggatcctt gttaacaaag ttaatcgtct tttaaggttt ggtttattca 16800 

ttaaatttgc ttttaagctg tagctgaaaa agaacgtgct gtcttccatg caccaggtgg 16860 

cagctctgtg caaagtgctc tctggtctca ccagcctttt aattgccggg attctggcac 16920 

gtctgagagg gctcagactg gcttcgtttg tttgaacagc gtgtactgct ttctgtagac 16980 

atggccggtt tctctcctgc agcttatgaa actgttcaca ctgaacacac tggaacaggt 17040 

tgcccaagga ggccgtggat gccccatccc tggaggcatt caaggccagg ctggatgtgg 17100 

ctctgggcag cctggtctgg tggttggcga tcctgcacat agcagcgggg ttgaaactcg 17160 

atgatcactg tggtcctttt caacccaggc tattctatga ttctatgatt caacagcaaa 17220 

tcatatgtac tgagagagga aacaaacaca agtgctactg tttgcaagtt ttgttcattt 17280 

ggtaaaagag tcaggtttta aaattcaaaa tctgtctggt tttggtgttt tttttttttt 17340 

atttattatt tctttggggt tctttttgat gctttatctt tctctgccag gactgtgtga 17400 

caatgggaac gaaaaagaac atgccaggca ctgtcctgga ttgcacacgc tggttgcact 17460 
cagtagcagg ctcagaactg ccagtctttc cacagtatta ctttctaaac ctaattttaa 17520 
tagcgttagt agacttccat cactgggcag tgcttagtga atgctctgtg tgaacgtttt 17580 
acttataagc atgttggaag ttttgatgtt cctggatgca gtagggaagg acagattagc 17 640 
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tatgtgaaaa 


gtagatcctg 


ag Latcgggg 




i— ~ X- ^ ^ — , — » — i 


— « 4-* **r 7» /-V o — « r» t- V 




cttgttgtaa 


ctaattggaa tttctttaag cgttcactta 


tgctacattc atagtatttc 


17760 


catttaaaag 


taggaaaagg 


taaaacgtga 


aatcgtgtga 


ttttcggatg 


gaacaccgcc 


17820 


ttcctatgca 


cctgaccaac 


ttccagagga 


aaagcctatt 


gaaagccgag 


attaagccac 


17880 


caaaagaact 


catttgcatt 


ggaatatgta 


gtatttgccc 


tcttcctccc 


gggtaattac 


17940 


tatactttat 


agggtgctta tatgttaaat gagtggctgg 


cactttttat 


tctcacagct 


18000 


gtggggaatt 


ctgtcctcta 


ggacagaaac 


aattttaatc 


tgttccactg 


gtgactgctt 


18060 


tgtcagcact 


tccacctgaa 


gagatcaata 


cactcttcaa 


tgtctagttc tgcaacactt 


18120 


ggcaaacctc 


acatcttatt 


tcatactctc 


ttcatgccta 


tgcttattaa 


agcaataatc 


18180 


tgggtaattt 


ttgttttaat 


cactgtcctg 


accccagtga 


tgaccgtgtc 


ccacctaaag 


18240 


ctcaattcag 


gtcctgaatc 


tcttcaactc 


tctatagcta 


acatgaagaa 


tcttcaaaag 


18300 


ttaggtctga 


gggacttaag 


gctaactgta 


gatgttgttg 


cctggtttct gtgctgaagg 


18360 


ccgtgtagta 


gttagagcat 


tcaacctcta 


g 






18391 



<210> 11 
<211> 586 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> MDOT artificial promoter 
<400> 11 

gtaccgggcc ccccctcgag gtgaatatcc aagaatgcag aactgcatgg aaagcagagc 60 
tgcaggcacg atggtgctga gccttagctg cttcctgctg ggagatgtgg atgcagagac 120 
gaatgaagga cctgtccctt actcccctca gcattctgtg ctatttaggg ttctaccaga 180 
gtccttaaga ggtttttttt ttttttggtc caaaagtctg tttgtttggt tttgaccact 240 
gagagcatgt gacacttgtc tcaagctatt aaccaagtgt ccagccaaaa tcgatgtcac 300 
aacttgggaa ttttccattt gaagcccctt gcaaaaacaa agagcacctt gcctgctcca 360 
gctcctggct gtgaagggtt ttggtgccaa agagtgaaag gcttcctaaa aatgggctga 420 
gccggggaag. gggggcaact tgggggctat tgagaaacaa ggaaggacaa acagcgttag 480 
gtcattgctt ctgcaaacac agccagggct gctcctctat aaaaggggaa gaaagaggct 540 
ccgcagccat cacagaccca gaggggacgg tctgtgaatc aagctt 586 

<210> 12 
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<2II> II 
<212> PRT 

<213> Artificial sequence 
<220> 

<223> SV40 terminator 
<400> 12 

Cys Gly Gly Pro Lys Lys Lys Arg Lys Val Gly 
15 10 



<210> 13 

<211> 12 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer SaltoNotI 

<400> 13 
tcgagcggcc gc 



<210> 14 
<211> 83 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer used in the formation 

of the chicken codon optimized human interferon 
2b-encoding nucleic acid 

atggctttga cctttgcctt actggtggct ctcctggtgc tgagctgcaa gagcagctgc 60 
tctgtgggct gcgatctgcc tea 

<210> 15 
<211> 100 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer used in the formation 

of the chicken codon optimized human interferon 
2b-encoding nucleic acid 

<400> 15 , n 
gacccacagc ctgggcagca ggaggaccct gatgetgetg gctcagatga ggagaatcag 60 

cctgtttagc tgectgaagg ataggcacga ttttggcttt 

<210> 16 
<211> 62 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> primer used in the formation . 

of the chicken codon optimized human interferon 
2b-encoding nucleic acid 

<400> 16 cri 
ctcaagagga gtttggcaac cagtttcaga aggctgagac catccctgtg ctgcacgaga 60 
. 62 

tg 

<210> 17 
<211> 94 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223>primer used in the formation 

of the chicken codon optimized human interferon 
2b-encoding nucleic acid 

<400> 17 cn 
.tccagcagat ctttaacctg tttagcacca aggatagcag cgctgcttgg gatgagaccc 60 

tgctggataa gttttacacc gagctgtacc agca 

<210> 18 
<211> 77 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer used in the formation 

of the chicken codon optimized human interferon 
2b-encoding nucleic acid 

<400> 18 . . . cn 

ctgaacgatc tggaggcttg cgtgatccag ggcgtgggcg tgaccgagac ccctctgatg 6U 

aaggaggata gcatcct 

<210> 19 
<211> 82 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer used in the formation 

of the chicken codon optimized human interferon 
2b-encoding nucleic acid 



<400> 19 



gctgtgagga agtactttca gaggatcacc ctgtacctga aggagaagaa gtacagccct 60 
tgcgcttggg aagtcgtgag gg 



<210> 20 
<211> 65 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> primer used in the formation 

of the chicken codon optimized human interferon 
2b-encoding nucleic acid 



<400> 20 i cn 

ctgagatcat gaggagcttt agcctgagca ccaacctgca agagagcttg aggtctaagg 60 

agtaa 



65 



<210> 21 
<211> 34 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> primer used in the formation 

of the chicken codon optimized human interferon 
2b-encoding nucleic acid 

<400> 21 - 
cccaagcttt caccatggct ttgacctttg cctt ^ q 

<210> 22 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer used in the formation 

of the chicken codon optimized human interferon 
2b-encoding nucleic acid 



<400> 22 

atctgcctca gacccacag 

<210> 23 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer used in the formation 

of the chicken codon optimized human interferon 
2b-enc6ding nucleic acid 

<400> 23 

gattttggct ttcctcaaga ggagtt /b 

<210> 24 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer used in the formation 

of the chicken codon optimized human interferon 
2b-encoding nucleic acid 
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<400> 24 

gcacgagatg atccagcaga t 

<210> 25 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer used in the formation 

of the chicken codon optimized human interferon 
2b-encoding nucleic acid 

<400> 25 

atcgttcagc tgctggtaca 

<210> 26 
<211> 20 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> primer used in the formation 

of the chicken codon optimized human interferon 
2b-encoding nucleic acid 

<400> 26 

cctcacagcc aggatgctat 

<210> 27 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer used in the formation 

of the chicken codon optimized human interferon 
2b-encoding nucleic acid 

<400> 27 

atgatctcag ccctcacgac 

<210> 28 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer used in the formation 

of the chicken codon optimized human interferon 
2b-encoding nucleic acid 

<400> 28 

ctgtgggtct gaggcagat 

<210> 29 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> primer used in the formation 

of the chicken codon optimized human interferon 
2b-encoding nucleic acid 

<400> 29 

aactcctctt gaggaaagcc aaaatc 

<210> 30 
<211> 21 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> primer used in the formation 

of the chicken codon optimized human interferon 
2b-encoding nucleic acid 

<400> 30 

atctgctgga tcatctcgtg c 

<210> 31 
<211> 36 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> primer used in the formation 

of the chicken codon optimized human interferon 
2b-encoding nucleic acid 

<400> 31 

tgctctagac tttttactcc ttagacctca agctct 



<210> 32 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer used in the synthesis of the MDOT 
<400> 32 

tcactcgagg tgaatatcca agaat 

<210> 33 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer used in the synthesis of the MDOT promote 
<40O> 33 

gagatcgatt ttggctggac acttg 

<210> 34 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> primer used in the synthesis of the MDOT promoter 



<400> 34 

cacatcgatg tcacaacttg ggaat 25 

<210> 35 
<211> 25 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> primer used in the synthesis of the MDOT promoter 
<400> 35 

tctaagcttc gtcacagacc gtccc 25 
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